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2-Azapurine compounds and their use 3 ^ Aug. 1999 



The present invention is directed to a nucleic acid binding compound comprising 
2-azapurines, a compound useful for the preparation of such compound, a binding 
product of this nucleic acid binding compound with a nucleic acid, a method for the 
determination of a nucleic acid using said compound, and several uses of 2-azapurine 
compounds. 

Some 2-azapurine compounds are known in the art. In Chemistry of Nucleosides 
and Nucleotides (Vol. 2, p. 288-297, and p. 319) there are shown some examples of 
2-azapurine nucleosides. However, there is no disclosure on 7-deaza-2-azapurine 
nucleosides. 

In Biochimica et Biophysica Acta, 520 (1978), p. 441-451, there is disclosed the 
enzymatic synthesis of 2-azaadenosine-5'-diphosphate and 2-aza inosine-5'- 
diphosphate and their polymerization to homopolymers using E.coli polynucleotide 
phosphoryiase. This method is complex and it is not capable of producing mixed 
sequences. Furthermore, there was reported on the capability of the homopolymers to 
form double- and triple-stranded complexes with other homopolymers. Those 
compounds only containing 2-azaadenosine as base have proved to be UV sensitive and 
labile. 

In J. Org. Chem. 1995, 60, 6262-6269, there is disclosed the synthesis and biophysical 
and biological properties of oligonucleotides containing 2-aza-2'-deoxyinosine. 
There is disclosure on a 2-aza inosine nucleoside, which is modified by a particular 
photochemically deavable protecting group at one of the ring nitrogen atoms. However, 
as can be seen from table 1 on page 6266, l M does not differentiate well between the 
natural nucleobases. As can be seen further, the stability of the base pair I^/G is 13 0 less 
than the regular base pair C/G and 9 0 less than the base pair A/T. The replacement of C 
by I M would therefore not be suitable for mimicking the stability of a A/T base pair. 
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In Liebigs Ann. Chem. 1990, 647-651, there are disclosed 2-azaadenine and a nucleoside 
of methylthioimidazotriazine as well as the corresponding methoxy compound. No 
phosphates or phosphoramidites are disclosed Furthermore, there is no disclosure on 
how to prepare oligonucleotides containing 2-aza-adenosine at specific positions. Their 
5 hybridization behaviour is not disclosed. Furthermore, there is no disclosure on 2-aza- 
2' -deoxyadenosinetriphosphate. 

The present invention is particularly useful in nucleic acid determinations, for example 
in analytics, especially in the field of health care. Nucleic acids have been found to be 

10 useful analytes for the determination of the presence or absence of genes or micro- 
organisms in human body fluids, food or environment Nucleic acid analysis has found 
widespread use after the introduction of nucleic acid amplification, like the Polymerase 
Chain Reaction (PCR, see US-A-4,683,202). Thus, sufficient amounts of nucleic acids 
are available from each sample. The nucleic acids can be determined from this 

1 5 pretreated sample using a variety of different techniques, dependent from the particular 
purpose. Most assays require the use of a probe which is either immobilized or immobi- 
lizable or is labelled by attachment of one or more reporter groups. A reporter group 
has the characteristics to be itself capable to be determined or it can be reacted with 
reagents that make the probe determinable via said reporter group. Thus, for example, 

20 probes that are labelled by reporter groups can be determined, as can be hybrids that 
contain the probe and a nucleic acid to be determined. In case of immobilized probes, 
the hybrid between the probe and the nucleic acid to be determined is determined at the 
solid phase to which the probe is bound. In a particular form of assays, not only one 
nucleic acid having a specific sequence, but a large number of nucleic acids of different 

25 sequence is determined. For this purpose, the probes are immobilized in tiny spots in an 
array on a flat surface such as a glass chip (EP-A-0 476 014 and TIBTECH (1997), Vol. 
15, 465-469). 

The basic principle of using oligonucleotide arrays was first proposed in the late 1980s 
30 when the concept of determining a DNA sequence by hybridization to a comprehensive 
set of oligonucleotides (SBH, sequencing by hybridization) was developed. 

There are many proposals to include modified or non-natural heterocyclic groups 
instead of the natural nucleobases. Examples of such non-natural groups are 7-deaza- 
35 dGTP which, when introduced into a nucleic acid replacing dGTP reduces band 
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compressing in sequencing gels (EP-B-0 286 028). 

Nucleic acid determinations generally suffer from the problem that the base pairing 
possibilities between the natural bases A and T and C and G have different stability. 

5 This can be attributed to the different capability of the bases to form hydrogen bonding. 
Thus, the dA-dT- base pair has two hydrogen bridges, while the dG-dC- base pair has 
3 hydrogen bridges. This results in different melting temperatures (Tm) of hybrids, 
depending on the GC content. The higher the GC content, the higher the Tm. In 
routine nucleic acid analysis, however, there would be the wish to equalize the Tm for 

10 nucleic acids of the same length, or even independent from the length of the nucleic 
acid or the binding region in order to be in the position to apply similar hybridization 
conditions for all assays. This is particularly necessary for assays using arrays, as on such 
arrays the hybridizing conditions for each probe must be identical. 

15 One solution was the use of low hybridization temperatures. Under such conditions, 
many nucleic acids having a low degree of base sequence complementarity will bind 
to the probe. This is called unspecific binding which does not allow discrimination 
between similar sequences. 

20 Another proposal was directed to the use of chemical reagents in the hybridization 
mixture, for example the addition of tetramethylammonium chloride (TMAC). This 
reagent reduces the difference between the stability of dG-dC and dA-dT basepairs but 
the effect is insufficient for short oligonucleotides. Further the addition of salts such as 
TMAC may not be wellcome as it complicates optimization of the assay. 

25 

Another proposal was directed to the use of different concentrations of each different 
(immobilized) probe in one assay. This was found to be technically complex if not 
impossible on a chip surface. 

30 As a further option the substitution of ribonucleotides in an oligonucleotide composed 
of deoxyribonucleotides, and vice versa was applied for the adaptation of DNA stability, 
Hoheisel (1996), Nucleic Acids Res. 24, 430-432. 

All proposals known now have some disadvantages. Therefore, there is still a need to 
35 provide probes the Tm of which is not very dependent from their GC content. 
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FIG 1 shows how z 2 Aa is capable of base pairing with a dG in another strand of a nucleic 
acid. 

5 In FIG 2 there are shown different compounds of the invention, like 2-aza-2'- 

deoxyadenosine (2), the corresponding triphosphate (11), a phosphoramidite for the 
introduction of 2-aza-2'-deoxyadenosine into oligonucleotides during conventional 
chemical automated synthesis (10 b) and a H-phosphonate (12). 

10 FIG 3 shows compounds useful in the present invention. The purine numbering for 
compound 2 and the systematic numbering for compound 3 is given. 

FIG 4 shows a route for synthesis of 2-aza-2'-deoxy-adenosine. 

15 FIG 5 shows compounds of the invention. 

FIG 6 shows a comparison of the binding of z 2 A<i with dG and dG with isoG d in 
antiparallel binding. 

20 FIG 7 shows that z 2 Aa can also bind in a parallel mode to isoG d , can bind to dC in 
parallel mode when protonated, and to isoC d in antiparallel mode if protonated. 

The subject of the present invention is a nucleic acid binding compound comprising a 
backbone, said backbone having attached heterocyclic groups capable of base pairing to 
25 natural nudeobases at least one of said heterocyclic groups being one of the naturally 

occurring nucleobases characterized in that at least one other of said heterocyclic groups 
is a group of the general formula I 



30 
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Formula I 
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N 



R 1 

1 



X 



w z 

s 
! 

D 



wherein 



10 



15 



20 



25 



W is selected independently from X, Y and Z from the group consisting of N 
and CR 2 , 

Z is selected from the group consisting of N and C with the proviso that 
ifZisN,then 

X independently from W and Y is selected from the group 
consisting of N and CR 3 , and 

Y independently from W and X is selected from the group 
consisting of N and CR 4 , 

and the bond between X and Y is a double bond and the bond 
between Y and Z is a single bond, and 

• ifZisQthen 

X is NR 33 , and 

Y is selected from the group consisting of N and CR 4 and 
the bond between Z and Y is a double bond and the bond between X 

and Y is a single bond, 

R 1 , R 2 , R 3 and R 4 are independently selected from the group consisting of -H, - 

halogen, -OR 13 , -SR. 19 , -(Ci-Cio)-alkyl, -(C 2 -Ci 0 )-alkenyl, -(C 2 -Ci 0 )-alkynyl, 
-N0 2 , -NR^ 6 , -cyano, and -C(=0)R u , 
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R" is sdected from the group consisting of -OH, -(Q-QO-alkoxy, -(Q-Cb)- 

aryloxy, and NHR 12 , 
R 5 , R 6 , R 12 , R 13 , R 19 and R 33 are selected independently from the group consisting 
of -H, -(Q-C 10 )-alkyl, -(C2-C 10 )-alkenyl, -(C 2 -C 10 )-alkinyl > -(Q-CaJ-aryl, a 
protecting group and a reporter group, 
r and s are independently of each other an integer of 1 to 18, 
D is the position of attachment of the group to the rest of the nudeic add 

binding compound, and 
said alkyl, alkenyl and alkynyl being unsubstituted or substituted by one or more 
moieties sdected from the group consisting of -halogen, -SH, -S-CCi-QO-alkyl, - 
(Q-QJ-alkoxy, -OH, -NR 5 R 6 , -COR 11 , -NH-CONR 5 R 6 , -NH-CSNR^ 6 and 
•[0-(CH 2 ) r ] s -NR¥. 

By way of example, in the following there is given an explanation of the advantageous 
1 5 property of the compounds of the invention by showing them at the example of 
2-aza-2'-deoxyadenosine (2, z 2 Ad) which forms specifically stable base pairs with 
2'-deoxyguanosine (dG) but much less stable base pairs with 2 , -deoxythymidine (dT), 
2'-deoxycytidine (dC) and 2'-deoxyadenosine (dA). The new base pair (^Aa-dG) is of 
analogous stability as a regular dA-dT base pair. 
20 - 

In order to equalize Tm, in a nudeic add binding compound one or more C in a strand 
complementary to a G in the nucleic add to be determined could be replaced by a z 2 Ad. 
The oligonucleotide would then bind specifically to the target sequence containing dG 
opposite to z 2 Ad but with the stability of a dA-dT and not a dG-dC base pair. This 

25 general principle of course is not limited to z 2 A d> as bases showing the same 

characteristics in the 6-membered ring would be expected to have the same properties 
based on the above explanation due to their containing the 2-azapurine structure. 
Particularly, the farer the part of the heterocyclic group from the part partidpating in 
the base pairing, the more tolerant will the oligomer be over modifications in the 

30 chemical structure, for example the attachment of groups to this part of the heterocyclic 
rings. In the following, when reference is made to the modified base (of the invention), 
there is made reference to a heterocyclic group according to general formula I. 
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The broken line in formula I indicates that there are several possibilities, depending 
upon the definitions of X, Y and Z, to localize a double bond. It is apparent to the man 
skilled in art that the choice of a specific definition of Z will require a double bond to be 
either between Z and Y or between X and Y. It is further evident that there will be not 
5 each a double bond between X and Y and Y and Z. 

Halogen means a fluoro, chloro, bromo or iodo group. The most preferred halogen 
groups are -CI and -Br. 

10 Alkyi groups are preferably chosen from alkyi groups containing from 1 to 10 carbon 
atoms, either arranged in linear, branched or cyclic form. The actual length of the alkyi 
group will depend on the steric situation at the specific position where the alkyi group is 
located. If there are steric constraints, the alkyi group will generally be smaller, the 
methyl and ethyl group being most preferred. All alkyi, alkenyl and alkynyl groups can 

15 be either unsubstituted or substituted. Substitution by hetero atoms as outlined above, 
will help to increase solubility in aqueous solutions. 

Alkenyl groups are preferably selected from alkenyl groups containing from 2 to 10 
carbon atoms. For the selections similar considerations apply as for alkyi groups. They 
20 also can be linear, branched and cyclic. The most preferred alkenyl group is the ethylene 
group. 

Alkynyl groups have preferably from 2 to 10 carbon atoms. Again, those carbon atoms 
can be arranged in linear, branched and cyclic manner. Further, there can be more than 
25 one triple bond in the alkynyl group. The most preferred alkynyl group is the 3- 
propargyl-group. 

Alkoxy groups preferably contain from 1 to 6 carbon atoms and are attached to the rest 
of the moiety via the oxygen atom. For the alkyi group contained in the alkoxy groups, 
30 the same considerations apply as for alkyi groups. The most preferred alkoxy group is 
the methoxy group. 

Aryloxy groups preferably contain from 6 to 20 carbon atoms. Those carbon atoms may 
be contained in one or more aromatic rings and further in side chains (for example, 
35 alkyi chains) attached to the aromatic moiety. Preferred aryloxy group are the phenoxy 
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and the benzoxy group. 

Preferred O-protecting groups in R 14 are the aroyi groups, the acyl groups and the silyi 
groups. Among these most preferred is the benzoyl group. 

5 

Preferred silyl groups are the trialkyisilyl groups, like triethylsilyL 

Any atom in the definitions within the formulae presented herein is not limited to a 
specific isotope. Thus, a phosphorous atom (P) can either mean the regular 3I P or the 
10 radioactive 32 P or a mixture thereof. The same applies for hydrogen (H/D/T), carbon 
(C), iodine (CI, Br, I) and nitrogen (N). 

Preferred group -NR^ 6 in the definition of R 2 , R 3 and R 4 is the -NH 2 group. In this 
case, it is evident that during chemical synthesis of compounds containing such group 
15 of formula I one of the hydrogen atoms of this amino group might be protected by 

suitable amino protecting group. Such protecting groups are generally known to a man 
skilled in the art. 



20 



25 



30 



The same applies for the definitions of R 1 . 

During chemical synthesis, any groups -OH, -SH and -NH 2 (including those groups in 
reporter groups) should be protected by suitable protecting groups. Further, during 
chemical synthesis, the compound will be attached for convenience to a solid phase. In 
these cases, the definitions of the substituents given above will be selected accordingly. 

A protecting group is a chemical group that is attached to a functional moiety (for 
example to the oxygen in a hydroxyl group, replacing the hydrogen) to protect the 
functional group from reacting in an undesired way. A protecting group is further 
defined by the fact that it can be removed without destroying the biological activity 
of the molecule formed, here the binding of the nucleic acid binding compound to a 
nucleic acid. Suitable protecting groups are known to a man skilled in the art. Especially 
preferred protecting groups for example for hydroxyl groups at the 5'-end of a 
nucleotide or oligonucleotide are selected from the trityl group, for example 
dimethoxytrityl. 



35 
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Preferred protecting groups at exocyclic amino groups in formula I are the acyl groups, 
most preferred the benzoyl group (Bz), phenoxyacetyl or acetyl or formyl, and the N,N- 



diall^formamidine group, preferentially the dimethyl-, diisobutyl-, and the di-n- 
butylformamidine group. 

5 

The nucleic acid binding compound according to the invention preferably has a length 
of less than 100 subunits, more preferably of from 10 to 30 subunits. In order to be 
active as nucleic acid binding compound, the substituents should be chosen such that 
hydrogen bonds to heterocyclic groups at the nucleic acid to be bound are enabled, 
10 preferably by Watson Crick base pairing and/or in the way as disclosed in figure 1. 

Compounds in which the substituents do not enable such preferred hydrogen bonding, 
can be useful as intermediates for the preparation of nucleic acid binding compounds. 
Preferred nucleic acid binding compounds of the invention are those which are 
chemically synthesized. 

15 

If the nucleic acid binding compound is to be used as a probe for the determination 
of a nucleic acid, or any other identification of the compound or the nucleic acid is 
intended, any of the substituents are selected such as to contain a reporter group. While 
as many reporter groups can be attached as useful to label the nucleic acid compound 

20 sufficiently, it is preferred to attach only a limited number of reporter groups to a single 
subunit, such that recognition of nucleic acids, affinities to nucleic acids and solubility 
is not affected such that the probe would not be useful in hybridization assays. In a very 
preferred case, there will be only from 1 to 4, most preferably 1 or 2 or most preferred 
only one reporter group in each nucleic acid binding compound. There are formats for 

25 the nucleic acid determination which require more than one reporter group attached to 
the probe. An example for such formats is disclosed in WO92/02638. In this case, one of 
the reporter groups will be a fluorescence emitter, while the other is a fluorescence 
quencher. 

30 Reporter groups are generally groups that make the nucleic acid binding compound 
as well as any nucleic acids bound thereo distinguishable from the remainder of the 
liquid, i.e. the sample (nucleic acid binding compounds having attached a reporter 
group can also be termed labelled nucleic acid binding compounds, labelled probes or 
just probes). This distinction can be either effected by selecting the reporter group from 

35 the group of directly or indirectly detectable groups or from the groups of immobilized 
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or immobilizable groups. Directly detectable groups are for example fluorescent 
compounds, like fluorescein and its derivatives, like hexachlorofluorescein and 
hexafluorofluorescein, rhodamines, psoralenes squaraines, porphyrines, fluorescent 
particles, bioluminescent compounds, like acridinium esters and luminol, or the 

5 cyanine dyes, like Cy-5. Examples of such compounds are disclosed in EP 0 680 969. 
Further, spin labels like TEMPO, electrochemically detectably groups, ferrocene, 
viologene, heavy metal chelates and electrochemiluminescent labels, like ruthenium 
bispyridyi complexes, and naphthoquinones, quencherdyes, like dabcyl, and nuclease 
active complexes, for example of Fe and Cu, are useful detectable groups. Indirectly 

0 detectable groups are groups that can be recognized by another moiety which is directly 
or indirectly labelled. Examples of such indirect detectable groups are for example 
haptens, like digoxigenin or biotin. Digoxigenin for example can be recognized by 
antibodies against digoxigenin. Those anibodies may either be labelled directly or can be 
recognized by labelled antibodies directed against the (antidigoxigenin) antibodies. 

5 Formats based on the recognition of digoxigenin are disclosed in EP-B-0 324 474. 

Biotin can be recognized by avidin and similar compounds, like streptavidin and other 
biotin binding compounds. Again, those compounds can be labelled directly or 
indirectly. 

0 The reporter group can further be a nucleotide sequence which does not interfere with 
other nucleotide sequences in the sample. The sequence can therefore be specifically 
recognized by nucleotide containing a complementary sequence. This nucleotide 
sequence can be labelled directly or indirectly or can be immobilizable or immobilized. 
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25 A reporter group can further be a solid phase. Attachment of the nucleic acid binding 
compound with solid phase can be either directly or indirectly as pointed out above for 
the detectable group. 

Direct labelling can be effected by covalent coupling of a nucleic acid binding 
30 compound to a reactive group on the solid phase, i.e. preferably via a linker. Indirect 
labelling can be made similar as disclosed above for the detectable groups. Preferably, 
indirect attachment is non-covalently by biospecific interactions, for example those 
selected from the group of hapten-antibody, vitamin-receptor and nucleic acid- 
complementary nucleic acid. Again, those interactions and their use in nucleic acid 
35 assays is known to a man skilled in the art. 
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Solid phases that are useful for immobilization of the probe according to the invention 
are preferably selected from the group of polystyrene, polyethylene, polypropylene, glass 
and T1O2. The formats of such solid phases can be selected according to the needs of the 
5 instrumentation and format of the assay. For example, a solid phase may assume 
the form of a bead or a vessel. 

The term reporter group and the specific embodiments preferably include a linker 
which is used to connect the moiety intended to be used (the actual solid phase or the 
10 fluorophoric moiety, to the position of attachment as the reporter group. The linker will 
provide flexibility such that the nucleic acid binding compound can bind the nucleic 
acid sequence to be determined without major hindrance by the solid phase. Linkers, 
especially those that are not hydrophobic, for example based on consecutive ethylenoxy 
units, for example as disclosed in DE 3943522 are known to a man skilled in the art. 

15 

From the above explanation, it becomes clear that the invention would still work, even 
if the backbone of the probe is not an oligonucleotide in the strict sense. There were 
described in the last years nucleic binding compounds that have similar properties like 
oligonucleotides, but differ in their backbone. The backbone is generally considered to 

20 be the part of the nucleic acid binding compound that bears the bases, mosdy in linear 
manner, bound to identical or not identical subunits. The most popular backbone is the 
naturally occurring sugar phosphate backbone of nucleic acids (containing either 
ribonucleoside subunits (RNA), deoxyribonucleoside subunits (DNA) or peptide 
nucleic acid subunits (PNA)). Therefore, in a preferred embodiment, the backbone 

25 comprises sugar and phosphate moieties. In a further preferred embodiment, the sugar 
configuration is selected from the group consisting of the ct-D-, p-D-, a-L- and (3-L- 
configurations, most preferred the compound contains at least one 2 , -deoxy-(5-D- 
erythro-pentofuranosyl moiety or one p-D-ribofuranosyl moiety. 

30 Preferred, D is the glycosid C- 1 of a sugar moiety of the compound according to the 
invention. Preferred compounds of formula VI are those wherein R 1 is NH 2 > W is N, Z 
is N, Y is C, X is N and R 14 is H. 

Further preferred nucleic acid binding compounds contain at least one group of 
35 formula I, wherein R l is the group -NR 2 ^ 21 , which are either 2-aza-adenosine or 
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derivatives thereof. As derivatives of 2-aza-adenine there are considered here 
compounds that provide hydrogen bonding via the same atoms as 2-aza-adenine 
to G and dG in a nucleic acid bound to the nucleic acid binding compound of the 
invention. The most preferred group of formula I is 2-aza-adenine, bound to the 
5 backbone via the N 9 atom. Those groups both discriminate clearly between the natural 
nucleobases and in addition provide a very similar stability as the A-T base pair. 

The nucleic acid binding compound will be constructed such that it contains a 
nucleobase sequence which is substantially complementary to the nucleic acid to be 

10 determined or the nucleic acid to which it is intended to be bound by base pairing. 
As those nucleic acids will usually contain at least once any of the naturally occurring 
nucleobases Ade, Cyt, Gua and Thy or Ura, the nucleic acid binding compound 
according to the invention will also contain any of those four bases. However, 
acccording to the invention, at least one of the heterocyclic groups in a position of the 

15 nucleic acid binding compound located vis-a-vis the G in the nucleic acid to be 

determined as replaced by the heterocyclic base of formula I. If there is more than one G 
in the sequence to which the nucleic acid binding compound is intended to be 
hybridized on the nucleic acid, preferably as many C*s in the nucleic acid binding 
compound are chosen to be heterocyclic groups of formula I as necessary to provide the 

20 Tm as intended. 

However, the nucleic acid binding compounds of the invention display the same base 
pairing selectivity also versus other heterocyclic groups in the position located vis-a-vis 
the position of the nucleic acid binding compound at which the group of formula I is 
25 located, especially versus derivatives of G, for example c 7 Gd, z 8 c 7 G d and z 8 G d . In these 
nominations, c 7 means that in the 7-position there will be a carbon atom and z 8 will 
correspondingly mean that in the 8-position there will be a nitrogen atom. Further, 
derivatives of G that can be recognized according to the invention, are nucleobases G 
which are labelled by the attachment of detectable groups, and iso G d . 

30 

The nucleic acid can also contain a heterocyclic group of formula I itself. The 
corresponding base on the nucleic acid binding compound will preferably be selected 
such as to base pair with this group, for example to be dG. The nucleic acid can contain 
natural and/or non-natural bases, for example 7-deaza-dGTP. Thus, the term nucleic 
35 acid will be construed in the present invention very broadly. Nucleic acids having mixed 
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base sequences being preferred. 

The nucleic acid binding compound according to the invention will bind to nucleic 
acids preferably in the antiparallel mode. However, by carefully selecting the 
nudeobases of a nucleic acid and/or of the nucleic binding compound, the binding 
can also be forced to be in the parallel mode. Parallel hybridization of nucleic acids 
containing iso-C and iso-G are for example disclosed in EP 0 624 161. 
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Preferred nucleic acid binding compounds are those, wherein the backbone comprises 
one or more moieties of the general formula II 



Formula II 




wherein 



15 



20 



25 



A 
L 
T 
U 



V 
R 14 



R 



22 



is selected from the group consisting of O, S and N-(Ci-Cio)-alkyi, 

is selected from the group consisting of oxy, sulfanediyl and -NR 22 -, 

is selected from the group consisting of oxo, thioxo and selenoxo, 

is selected from the group consisting of -OH, -O-reporter group, -SH, -S 

reporter group -SeH, -(Ci-Qo)-alkoxy, (CrQo)-alkyl, -(Q-C^-aryl, -(Q>- 

C 14 )-aryl-(Ci-C 10 )-alkyl, -NR 23 R 24 , and • 

-0-(C 1 -C 10 )-alkyl-0-(C 1 -Cio)-alkyl-R 25 , or wherein -NR^R 24 can together 

with N be a 5-6-membered heterocyclic ring, 

is selected from the group consisting of oxy, sulfanediyl or -NR 22 -, 

is selected from the group consisting of -H, -OH, -(Q-QcO-alkoxy, 

-(C 2 -Ci 0 )-alkenyloxy, -halogen, -azido, -O-allyl, -O-alkinyi, and -NH 2 , 

is independently selected from the group of-H and -(Ci-Cio)-alkyl, 
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R 23 and R 24 are independently selected from the group consisting of -(Ci-Qo)- 
alkyl, -(Q-C 2 o)-aryl, -(C 6 -C 14 )-aryl-(C 1 -Ci 0 )-alkyi, -(Q-C 6 )-alkyl- 
[NH(CH 2 ) c ]d-NR 26 R 27 and a reporter group, 

R 25 is selected from the group consisting of -H, -OH, -halogen, -amino, 
5 -(Ci-Ci 8 )-alkylamino, -COOH, -CONH 2 and -COO(Ci-C 4 )-alkyl and a 

reporter group, 

R 26 and R 27 are independently selected from the group consisting from -H, 

-(Ci-C 6 )-alkyl, and -(Q-C 4 )-alkoxy-(Ci-C 6 )-alkyl and a reporter group, 
c is an integer from 2 to 6, 
10 d is an integer from 0 to 6, and 

B is a moiety of formula I, 

wherein any alkyl, alkenyl and alkynyl can be substituted or unsubstituted, 
15 and any salts thereof. 

The preferred definitions of the groups as defined under formula I apply to formula II 
and the following formulae, if not indicated otherwise. 

20 A preferred subject of the invention is therefore a nucleic acid binding compound as 
outlined above, wherein the backbone comprises one or more moieties of the general 
formula III 



Formula III 




25 wherein 

A is selected from the group consisting of O, S and N-(C 1 -C 6 )-alkyl, 
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M is selected from the group consisting of 0x7, sulfanediyl, -NR -, -(Ci-Qo)- 
alkyi-, or -O-(C r Ci 0 )-alkyi-O-, and -S-(Q-Cio)-alkyi-0- and -NR 22 - 
(CrQ)-alkyl-O-, 

R 22 is selected from the group of -H, -(Ci-Cio)-alkyl, a protecting group and a 
reporter group, 

R 14 is selected from the group consisting of -H, -OH> -(Ci-Cio)-alkoxy, 

-(C2-Cio)-alkenyloxy > -(C 2 -Qo)-alkynyloxy, -halogen, -azido, SH, -(C1-Q0)- 
alkylmercapto and -NH2, 

R 15 is selected from the group consisting of -H, -(Q-C^-alkyl, -(C2-C10)- 
alkenyl, -(C 2 -Ci 0 )-alkynyl, -(C2-Cio)-alkyl-carbonyl, -(C 3 -Ci 9 )-alkenyl- 
carbonyl, -(C 3 -Ci 9 )-alkynyl-carbonyl, -(C 6 -Ci 4 )-aiyHCi-Cio)-alkyi, a solid 
phase and a group of formula IV 



Formula IV | 

U - p - R29 

II 
T 



wherein 

T is selected from the group consisting of oxo, thioxo and selenoxo, and 

U is selected from the group consisting of -OH, -O-reporter group, -SH, -SeH, 

-(Ci-C 10 )-alkoxy, -(Q-Cio)-alkyl, -(Q-C^-aryl, -(C 6 -C u )-aryi-(Ci-Cio)- 

alkyl, -NR 23 R 24 , and -O-(Ci-Ci 0 )-alkyl-O-(Ci-Ci 0 )-alkyl-R 25 , or wherein 

NR 23 R 24 can together with N be a 5-6-membered heterocyclic ring, 
R 23 and R 24 are independently selected from the group consisting of -(C1-Q0)- 

alkyl, -(Q-C 20 )-aryl, -(C 6 -Ci 4 )-aryl-(C 1 -Cio)-alkyl, -(Ci-QJ-alkyi- 

[NH(CH 2 ) c ]d-NR 26 R 27 , 
R 25 is selected from the group consisting of -H, -OH, -halogen, -amino, 

-(Ci-Ci 8 )-alkylamino, -COOH, -CONH 2 and -COO(d-C 4 )-alkyl, 
R 26 and R 27 are independendy selected from the group consisting from -H, 

-(Q-C 6 )-alkyl, and -(C r C 4 )-aIkoxy-(Ci-Q)-alkyi 
R 29 is selected from the group consisting of -OR 30 and -SR 30 , 
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R is selected from the group consisting of -H, -(C r Ci 0 )-alkyl, -(C 2 -Ci 0 )- 
alkenyl, -(C 6 -C 2 2)-aryl, a protecting group, a solid phase and a reporter 
group 

B is the link to a moiety of formula I, 
and any salts thereof. 



10 



15 



20 



25 



30 



35 



For the definitions and preferences the particulars apply as outlined for the substituents 
under formulae I and II, if not specified otherwise specifically for formula III. 

Preferably, in compounds of formula II, R 14 is hydrogen. Preferred definition of L is 
oxy. Preferred definition of U is -OH and -O-reporter group. Preferred definition of V 
is oxy. Preferred definition of c is an integer from 2 to 4, and of d an interger from 0 to 
2. 

Compounds of formula II are especially suited to contain the heterocyclic moiety of the 
invention as an integrated part (preferably not at one of the termini) of the nucleic acid 
binding compound. 

The group NR^R 24 is preferably selected from the group consisting of dialkyiamino 
groups. In case of this group together with the forming of 5- or 6-membered 
heterocyclic ring, it assumes preferably the definition of pyrrolidinyl or piperidinyl. 

Preferred aryl group is the phenyl or naphtyl moiety, either unsubstituted or substituted 
by one or more of amino, -aminoalkyl, -O-(Q-Ci 0 )-alkyl, -S-(Ci-C, 0 )-alkyl, -(C,-C l0 )- 
alkyl, sulfonyl, sulfenyl, sulfinyi, nitro and nitroso. Most preferred aryl group is the 
phenyl group. Preferred arylalkyl group is the benzyl group. The preferred alkylamino 
group is the ethylamino group. The preferred -COCKQ-Q) alkyl group contains one or 
two carbon atoms in the alkyl moiety (methyl or ethyl esters). 

Nucleic acid binding compounds, wherein the group of formula I is attached to submit, 
for example the nucleotide, at the 3'-terminus of the compound, are useful either as 
starting compound for longer compounds or/and as end-labelled probes. This group of 
compounds is especially preferred because the terminal position of probes generally is 
the most tolerant in view of attachment of chemical moieties. 
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A preferred subject of the invention is a nucleic acid binding compound as outlined 
above comprising a backbone moiety of the formula V 



Formula V 




t— 1 R14 

wherein 

A is selected from the group consisting of O, S and N-(Q-C6)-alkyl, 

M 1 is selected from the group consisting of oxy, sulfanediyl, -NR 22 -, -(Q-Cio)- 

alkyi, or -O-(Ci-Ci 0 )-alkyl-O-, and -S-(Ci-Ci 0 )-alkyl-O- and -NR 22 -(Q- 

C 6 )-alkyl-0-> 

R 22 is selected from the group of -H, a protecting group, a reporter group and 
-(Q-Qo)-alkyl, 

R 14 is selected from the group consisting of ~H> -OH, -(Q-Cio)-alkoxy, 

-(C 2 -Cio)-alkenyloxy, -(C 2 -Ci 0 )-alkynyioxy, -halogen, azido, -SH, -S-(Q- 
C 6 )-alkylmercapto and NH 2 , 

R 16 is selected from the group consisting of -H, -(Q-Q^alkyl, -(QrCi 8 )- 
alkenyl, -(C 2 -Ci 8 )-alkynyl, -(Q-Qs^alkyl-carbonyl, -(C 3 -Ci 9 )-alkenyl- 
carbonyl, -(C 3 -C l9 )-aUcjmyl-arbonyl, -(Q-Ci^-aryi-CCi-CgJ-alkyl, a 
protective group or a compound of formula IV 



Formula IV j 

U- P- R 29 



wherein 



T 
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U is selected from the group consisting of -OH, -SH, -SeH, -(Ci-Cio)-alkoxy, 

-(Ci-Cio)-alkyi, -(C 6 -C 22 )-aryl, -(C 6 -Ci 4 )-aryl-(C 1 -Qo)-alkjd, -NR^R 24 , and 

-OKCrdoJ-alkyl-O^Q-CioJ-alkyl-R 25 , wherein NR^R 24 can together with 

N be a 5-6-membered heterocyclic ring, 
R 23 and R 24 are independently selected from the group consisting of -(Ci-C 10 )- 

alkyl, -(Q-C 20 )-aryl, -(C6-Ci4)-arjd-(Ci-Cio)-alkyl, -(d-Gd-alkyl- 

[NH(CH 2 ) c ] d -NR 26 R 27 , 
R 25 is selected from the group consisting of -H, -OH, -halogen, -amino, 

-(Q-Ci 8 )-alkylamino, -COOH, -CONH 2 and -COOCQ-QJ-alkyl, 
R 26 and R 27 are independently selected from the group consisting from -H, 

-(Ci-C 6 )-alkyl, and -(Ci-C^-alkoxy-Cd-QJ-alkyl 
R 29 is selected from the group consisting of -OR 30 and -SR 30 , 
R 30 is selected from the group consisting of -H, -(Ci-Cio)-alkyi, -(C 2 -Qo)- 

alkenyl, -(C 6 -C 22 )-aryl, a protecting group, a solid phase and a reporter 

group, and 
B is the link to a moiety of formula I, 



wherein any alkyl, alkenyl and alkynyl can be substituted or unsubstituted, 



20 



and any salts thereof. 



25 



30 



35 



Those compounds are compounds that can be used as 5 f -terminally labelled probes. 
Regarding the definitions of the substituents, the definitions as given above apply if not 
indicated otherwise. 

A very preferred compound is a compound of formula V, wherein M is O, R 16 is H and 
R 14 is selected from the group consisting of hydrogen and hydroxyl. 

The backbone of the nucleic acid binding compound has the function to bear the base 
pairing heterocydes such that the compound can bind to a nucleic acid having a 
complementary sequence. Preferably, the degree of complementarity in the naturally 
occurring bases will be in the range from 70 % up to 100 % in a stretch of bases in a 
region effecting binding, compared to the stretch of same length in the region of the 
nucleic acid to be bound. Deletions and insertions of subunits in each sequence will 
therefor, in this calculation, be counted as gaps until the next fitting base and thus 
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reduce complementarity by as many bases as the gap contains. 

Preferred backbone contains sugar-phosphate moieties. From these, deoxy sugar 
containing backbones are further preferred. 

5 

Each moiety in the backbone bearing a moiety capable of base pairing to a nucleic acid 
of complementary sequence, including the moieties of the invention, are termed a 
subunit. Compounds are known that have backbones mixed of different kinds of 
subunits. Recently, a new kind of non-natural nucleic acid binding compounds was 
10 described. They are termed Peptide Nucleic Acids (PNA), as they contain at least one 
peptide bond between the subunits (WO 92/20702). The nucleic acid binding 
compound of the present invention can have any length. However, due to the 
convenience of chemical synthesis, compounds of a length of less than 100, more 
preferably from 10 to 30 subunits, for example nucleosides, are preferred. 

15 

The nucleic acid binding compound of the present invention can be prepared analogous 
to known methods. 

In a first option which is particularly suitable for short compounds, the compounds are 
20 produced by chemical synthesis (multistep oligomerisation) using chemically activated 
derivatives of the subunits (monomers), at least one of them containing the modified 
base of the invention. Preferably, reactive groups in the monomers that are not involved 
in the actual reaction step of the oligomerisation reaction are protected using an 
appropriate protecting group. Such protecting groups are well known in the art and 
25 there will be no major change in protection and synthesis strategy if the groups of the 
invention are used. 

An activated subunit is a subunit containing a substituent especially suited for chemical 
reaction with a predetermined substituent in another subunit, on the surface of a solid 
30 phase or in an oligomer formed of subunits. Such especially suited subunits are 

preferably selected from the group consisting of phosphoramidites, phosphonates (like 
methylphosphonates), phosphotriesters, phosphothioates, phosphodithioates, 
boranophosphates (see Chem. Commun. 1999, 1517-1518), phosphate methyl esters, 
phenylphosphonates and phosphate ethyl esters. 

35 
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The predetermined substituents are preferably selected from the group of -NH 2 , -SH 
and -OH. 

A further subject of the invention is therefore a method for the chemical synthesis of 
5 a compound of any of claims 1 to 13 using activated subunits, wherein said subunit 

contains at least one group of formula I. There are several approaches known for the 

chemical synthesis, such as the phosphotriester method of Narang et al., Meth. 

Enzymol. 68, 90-99 (1979); the phosphodiester method of Brown et al., Meth. Enzymol. 

68, 109-151 (1979); the diethylphosphoramidite method of Beaucage et al., Tetrahedron 
10 Lett. 22, 1859-1862 (1981); and the solid support method described in the U.S. Patent 

Specification No. 4,458,066 and in Methods in Molecular Biology, Ed. S. Agrawal, Vol. 

20, Humana Press, Totowa, NJ, 1993. The most preferred method of chemical synthesis 

uses the phosphoramidite approach. A particularly preferred method uses a activated 

subunit one or more compounds of general formula VII. This method has the 
15 advantage that it is very convenient and the reagents necessary, for example a 

phosphoramidite containing a group of formula I, is possible to be included easily. 

A further subject of the invention are therefore compounds of the general formula VII 



Formula VII 




. B 
M R 14 

^/NR32R17 

20 

wherein 

A is selected from the group consisting of O, S and N-(Ci-C 6 )-alkyl, 
M and M" are independendy selected from the group consisting of oxy, 
25 sulfanediyl, -NR 22 , -(Ci-Qo)-alkyl, or -O-(Ci-Ci 0 )-alkyl-O-, and -S- 

(Q-Co)-alkyl-O- and -NR^-tQ-QJ-alkyl-O-, 
R 22 is selected from the group of -H and -(Ci-Cio)-alkyl, 
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R w is selected from the group consisting of -H, -OR 31 > -(Ci-Qo)-alkoxy> 

-(C 2 "Cio)-alkenyloxy, -(C 2 -C 10 )-alkynyloxy > -halogen, -azido NHR 31 , SR 31 
and -NH 2 , 

R 31 is a protecting group or a reporter group, 

R 32 and R 17 are independently selected from the group consisting of -H, -(Ci-Qo)- 

alkyl, -(C 2 -Cio)-alkenyl and -(C 6 -C 22 )-aryi, 
R 18 is selected from the group consisting of substituted or unsubstituted -(Q- 

C 6 )-alkyl > unsubstituted -(Ci-C 6 )-alkoxy or -(Ci-C 6 )-alkoxy substituted one 

or more times by a group selected from the group consisting of -halogen, 

p-nitroaryloxy and -cyano, and 
B is a group of formula L 



Preferred compounds of formula VII are those wherein the group of formula I is 
not 2-aza-hypoxanthine. In a preferred embodiment, the group of formula I in 
15 formula VII contains at least one reporter group- Most preferable, the group of 

formula I contains exactly one reporter group. 

Most preferred in such compounds, in -NR 5 R 6 at least one of R 5 and R 6 is a protecting 

group. 

20 Further subject of the invention are compounds of general formula IX 



Formula IX 



NR 32 R 17 

i 

R 



R 18 



R 31 / \ 



wherein 



25 



A is selected from the group consisting of O, S and N-CQ-C^-alkyl, 
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M and M' are independently selected from the group consisting of oxy, 

sulfanediyl, -NR 22 , -(Ci-Cio)-alkyi, or -O-(Q-C 10 )-alkyt-O-, and -S- 

(Q-C 10 )-alkyl-O- and -NR 22 -(Q-C6)-alkyl-0-, 
R 22 is selected from the group of -H and -(Ci-Cio)-alkyl, 
5 R 14 is selected from the group consisting of -H, -OR 31 , -(Q-Cio)-alkoxy, 

-(C 2 -Cio)-alkenyloxy, -(C 2 -C 10 )-alkynyloxy, -halogen, -azido NHR M , SR 31 

and -NH 2 , 

R 31 is a protecting group or a reporter group, 

R 32 and R 17 are independenuy selected from the group consisting of -H, -(Ci-Qo)- 
10 alkyl, -(C 2 -Cio)-alkenyl and -(Q-C 22 )-aryl, 

R 18 is selected from the group consisting of substituted or unsubstituted -(Ci - 
QO-alkyl, unsubstituted -(Ci-Q)-alkoxy or -(Ci-Q)-alkoxy substituted one 
or more times by a group selected from the group consisting of -halogen, 
p-nitroaryloxy and -cyano, and 
15 B is a group of formula I 



Formula I P 1 



N 

i 

N 




X 

w £ 



! I! :Y 



20 wherein 

W is selected independently from X, Y and Z from the group consisting of N 
and CR 2 , 

Z is selected from the group consisting of N and C with the proviso that 

25 

if Z is N, then 
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X independently from W and Y is selected from the group 
consisting of N and CR 3 , and 

Y independently from W and X is selected from the group 
consisting of N and CR 4 , 

5 and the bond between X and Y is a double bond and the bond 

between Y and Z is a single bond, and 

ifZisC,then 

10 X isNR 33 ,and 

Y is selected from the group consisting of N and CR 4 and 
the bond between Z and Y is a double bond and the bond between X 

and Y is a single bond, 

15 R 1 , R 2 , R 3 and R 4 are independently selected from the group consisting of -H, - 

halogen, -OR 13 , -SR 19 , -(Ci-Cio)-alkyl, -(C 2 -Ci 0 )-alkenyl, -(C 2 -Ci 0 )-alkynyl, 
-N0 2 , -NR 5 R 6 , -cyano, and -C(=0)R n , 
R u is selected from the group consisting of -OH, -(Ci-QValkoxy, -(Q-C22)- 
aryioxy, and NHR 12 , 

20 R , R°, R", R", R" and R" are selected independently from the group consisting 

of -H, -(Ci-C l0 )-alkyl, -(C 2 -Ci 0 )-alkenyi, -(C 2 -C 10 )-alkinyl, -(Q-Q^-aryl, a 
protecting group and a reporter group, 
r and s are independently of each other an integer of 1 to 18, 
D is the position of attachment of the group to the rest of the nucleic acid 
25 binding compound, and 

alkyi, alkenyl and alkynyl being unsubstituted or substituted by one or more 
moieties selected from the group consisting of -halogen, -S-(Ci-C 6 )-alkyl, -(Q- 
QO-alkoxy, -NR^ 6 , -CO-R 11 , -NH-CO-NR^ 6 , -NH-CSNR^ 6 - and -[O- 
(CH 2 ) r ] s -NR 5 R 6 , 

30 with the proviso that at least one of R 5 and R 6 of -NR^ 6 is a protecting group. 

Those compounds can be used like those of formula VII in chemical synthesis. 
A further subject of the invention are compounds of general formula X 

35 
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Formula X 



M' 



M R 14 



H- 



wherein 



M and M 1 are independently selected from the group consisting of oxy, 
sulfanediyl, -NR 22 , -(C,-C 10 )-alkyl, or -O-(Ci-C 10 )-alkyl-O-, and -S- 
(Q-CoJ-alkyl-O- and -NR^Q-QO-alkyl-O-, 
is selected from the group of -H and -(Q-Cio)-alkyl, 
is selected from the group consisting of -H, -OR 31 , -(Q-Cio)-alkoxy, 
-(C 2 -Cio)-alkenyloxy, -(C 2 -C 1 o)-alkynyloxy, -halogen, -azido NHR 31 , SR 31 
and -NH 2 , 

is a protecting group or a reporter group, 
is a group of formula I 



R 22 
R 14 



R 
B 



31 



Formula I 



R 1 



N 

i 

N 




W 



wherein 



W is selected independently from X, Y and Z from the group consisting of N 
and CR 2 , 
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Z is selected from the group consisting of N and C with the proviso that 



ifZisN, then 



10 



X independently from W and Y is selected from the group 

consisting of N and CR 3 , and 
Y independently from W and X is selected from the group 

consisting of N and CR 4 , 
and the bond between X and Y is a double bond and the bond 
between Y and Z is a single bond, and 



ifZis C, then 



15 



X is NR 33 , and 

Y is selected from the group consisting of N and CR 4 and 

the bond between Z and Y is a double bond and the bond between X 
and Y is a single bond, 



20 



25 



30 



R 1 , R 2 , R 3 and R 4 are independently selected from the group consisting of -H, - 

halogen, -OR 13 , -SR 19 , -(Q-Cio)-alkyl, -(Q-QcO-alkenyl, -(Q-QoJ-alkynyl, 

-N0 2 , -NR 5 R 6 , -cyano, and -C(=0)R u , 
R 11 is selected from the group consisting of -OH, -(Ci-C6)-alkoxy, -(Q-C22)- 

aryloxy, and NHR 12 , 
R 5 , R 6 , R 12 , R 13 , R 19 and R 33 are selected independently from the group consisting 

of -H, -(Q-Cio)-alkyl, -(Q-Cio)-alkenyl, -(C2-C 10 )-alkinyl, -(Q-C^-aryi, a 

protecting group and a reporter group, 
r and s are independently of each other an integer of 1 to 18, 
D is the position of attachment of the group to the rest of the nucleic acid 

binding compound, and 
alkyl, alkenyl and alkynyi being unsubstituted or substituted by one or more 
moieties selected from the group consisting of -halogen, -S-(Ci -C$) -alkyl, -(Q- 
QO-alkoxy, -NR 5 R 6 , -CO-R 11 , -NH-CO-NR^ 6 , -NH-CSNR^ 6 - and -[O- 
(WJs-NR^ 6 . 



35 Those compounds are also useful in chemical synthesis, 
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In another option which is more suited for long oligomers and those based on natural 
backbones, the oligomers are produced enzymatically. In this case, a starting oligomer 
is reacted with a polymerase and a triphosphate or modified triphosphate such that a 
monophoshate or a modified monophosphate is attached to a terminus of the oligomer, 
thus elongating the oligomer. Also for this method, the man skilled in the art will know 
several possible formates, like the nick- translation approach, or the simple primer 
extension ( J. Sambrook. E.F. Fritsch, T. Maniatis, Molecular Cloning - A laboratory 
Manual, Cold Spring Harbor Laboratory Press 1989). 

For example, the incorporation of z 2 Ad into a DNA sequence can be performed via 
conventional methods, e.g. by polymerase-catalyzed incorporation of s^Ad-S'- 
triphosphate (11). 

15 A further subject of the invention is therefore a method for the enzymatic synthesis of a 
nucleic acid binding compound according to the invention comprising reacting a 
triphosphate subunit with a primer using a nucleic acid as a template for the elongation 
of the primer, wherein the triphosphate subunit contains a heterocyclic group of 
formula L Preferably, the triphosphate subunit has the formula VI. 



20 



25 



A further subject of the present invention are therefore compounds of the general 
formula VI 



Formula VI 




wherein 

A is selected from the group consisting of O, S and N-CCi-QO-alkyi, 
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R u is selected from the group consisting of -H, -OH, -(Ci-do)-alkoxy, O- 
protecting group, S-protecting group, NH-protecting group, -(C 2 -Qo)- 
alkenyloxy, -halogen, -azido, -SH, -(Q-Q)-alkylmercapto and -NH2 , 

R 15 and R 16 are independently selected from the group consisting of -H, -(Q-C 8 )- 
alkyl, -(C2-Ci 8 )-alkenyl, -(C 2 -Ci 8 )-aIkynyi, -(C2-Ci 8 )-alkyl-carbonyl, 
-(C3-Ci9)-alkenyl-carbonyl, -(C 3 -C l 9)-alkynyl-carbonyl, -(C6-Cu)-aryl- 
(Ci-C 8 )-alkyl, a protecting group or a compound of formula IV 



Formula IV 



U - P - R 29 
T 



wherein 



10 



15 



20 



25 



>23 



R 



25 



30 



T is selected from the group consisting of oxo, thioxo and selenoxo, 
U is selected from the group consisting of -OH, -SH, -SeH, -(Q-Cio)-alkoxy, 
-(C,-C 10 )-alkyl, -(Q-C^-aryl, -(Q-C, 4 )-aryl-(Ci-Cio)-alkyl, -NR^R 24 , and 
-0-(Ci-Ci 0 )-alkyl-0-(Ci-Cio)-alkyl-R 25 > or wherein NR^R 24 can together 
with N be a 5-6-membered heterocyclic ring, 

and R 24 are independently selected from the group consisting of -(C1-Q0)- 
alkyi, -(Q-C 2 o)-aryl, -(C6-Cu)-aryi-(Ci-Cio)-alkyl, -(Q-C^-alkyl- 
[^(CHzJdd-NR^R 27 , 

is selected from the group consisting of -H, -OH, -halogen, amino, 
-(Q-Ci 8 )-alkylamino, -COOH, -CONH 2 and COO(C 1 -C 4 )-alkyl, 
R 26 and R 27 are independently selected from the group consisting from -H, 
-(Ci-QO-alkyl, and -(Ci-C 4 )-alkoxy-(Q-C6)-alk5d, 
is selected from the group consisting of -OR 30 and -SR 30 , 
is selected from the group consisting of -H, -(Ci-C^-allcyl, -(C2-C10)- 
alkenyl, -(C6-C22)-aryl, a protecting group, a diphosphate and a reporter 
group, and 

M and M 1 are independently selected from the group consisting of oxy, 

sulfanediyl, -NR 22 , -(Ci-Cio)-alkyi, or -O-(Ci-Ci 0 )-alkyl-O-, and -S- 
(Ci-Cio)-alkyl-O- and -NR 22 -(Q-Q)-alkyl-0-, 

R 22 is selected from the group of -H and -(Ci-Qo)-alkyl, and 



R 29 

R 30 
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B is a moiety of formula I, 

wherein any alkyl, alkenyl and alkynyl can be substituted or unsubstituted, and 
wherein at least one of R 15 and R 16 is not a group of formula IV with the proviso 
that 

MR 16 , MR 15 and R 14 are not each -OH if R 1 is -NH 2 and if either 
W and X and Y and Z is N, or 

- WandXandZisNandYisCR 4 ,or 

- WandYandZisNandXisCR 3 . 

Most preferred in these compounds -MR 16 is a triphosphate group and -MR 15 is OH. 
The most preferred compound is the one in which R 14 is -OH. 



Those compounds are especially the compounds, wherein the heterocyclic moiety of the 
1 5 invention is contained not at the terminal position of the nucleic acid binding 
compound. 

Preferred compounds are those, wherein M is oxy or sulfanediyl, R 16 is a compound of 
formula IV wherein U is -OH, T is oxo or thioxo, R 29 is -OR 30 and R 30 is a diphosphate 
20 group and the salts thereof. 

Most preferred compounds are of formula VIII 



Formula VIII 



PPP-O 




OH R 14 



25 



wherein 



PPP is a triphosphate group, 

R 14 is selected from the group consisting of -H, -OH, -(C 1 -Ci 0 )-alkoxy, 

- ( Q-Cio) -alkenyloxy, -(Q-Cio)-alkynyloxy halogen, -azido and NH 2 , and 
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B is a group of formula I> 

with the proviso that R 14 is not OH if B is 2-azaadenine. 

In another preferred embodiment, the group of formula I in compounds of formula VII 
is selected from the group consisting of groups of formula I, wherein either 

W is N, Z is N, Y is N and X is CR 3 , or 
WisN,ZisC,YisNandXisCR 3 ,or 
WisN,ZisN,YisNandXisN. 

A further subject of the invention is a method for the enzymatic synthesis of a nucleic 
acid binding compound of the invention comprising reacting a triphosphate subunit 
with a primer using a nucleic acid template for the elongation of the primer, wherein 
the triphosphate subunit contains a heterocyclic group of formula I. 

More preferable, this method uses as a triphosphate subunit a compound of formula 
VIII as defined above. 



The compounds of general formulae VI, VII and VIII can be prepared from compounds 
20 readily available. In a first embodiment, the compounds of formula VI characterized by 
the fact that the atom in the 2-position (purine numbering) are prepared by chosing the 
corresponding compound having a carbon atom in this position (preferably wherein 
MR 15 is OH and M'R 16 is OH and R 1 is NH 2 , reacting it with 2-chloroacetaldehyde, 
cleaving the pyrimidine ring using alkaline conditions and again closing the ring using 
25 sodium nitrite and thereafter hydrolyzing the imidazole ring introduced by the 
aldehyde. This reaction yields a replacement of the carbon atom of the 2-position 
by a nitrogen atom. 

This compound can then be reacted with reagents to attach protecting groups or/and 
30 activating groups, like phosphoramidites or phosphates, to the hydroxyl groups 

intended to be used in oligomerization. Methods for attaching protecting groups, like 
the dimethoxytrityl group are generally known to a man skilled in the art. Also the 
attachment of mono-, di- and triphosphate groups are known in the art. 



35 In a preferred embodiment, the method for the preparation of a compound of formula 
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VII, wherein R 1 is a protected amino group, M'R 31 is O-protecting group and 
MPR I8 NR 32 R 17 is O-phosphoramidite group, a compound of formula VI, wherein M'R 16 
and MR 15 are each OH and R 1 is NH 2 is subjected to conditions to protect the amino 
group, preferably by the protecting group dialkylaminomethylidene, then reacted with 
5 reagents to protect the S'-hydroxyi group and then reacted with an activated phosphane 
to produce the phosphoramidite. The resulting compound of formula VII can be used 
in the chemical synthesis for the introduction of the 2-aza-purine directly. The 
protecting group at R 1 will be removed during the chemical synthesis. 

10 For an alternative synthesis of 2-aza-2'-depxyadenosine see N. Yamaji, M. Kato, 
Chemistry Lett. 1975,311-314). 

By the above methods, it is principally possible to introduce only one monomer 
containing the moiety of the invention, but also more that one, as the case may be. 
1 5 The highest reduction of T m will occur, if all C in the binding region are replaced. This - 
allows fine tuning of the Tm. Of course there may also be remaining Cs in any regions 
that are not intended to base pair with the nucleic acid to be determined* 

Compounds of formula I bis VIII, in which any of substituents R 1 , R 2 , R 3 and R 4 are 
20 selected from the group consisting of halogen, especially CI, cyano or SR 19 are useful as 
intermediate products for the synthesis of compounds wherein those substituents are 
selected from the group of -NR 5 R 6 and -OR 13 , preferably -NR 5 R 6 . The intermediate 
products can be converted into the final products by substitution reaction. 

25 A further subject of the invention is a method for the determination of a nucleic add 
comprising the steps 

providing a sample suspected to contain said nucleic acid, 
providing a nudeic acid binding compound of daim 1, which is essentially 
30 complementary to a part or all of said nuddc add, 

contacting said sample with said nucleic add binding compound under 
conditions for binding said nucleic add binding compound to said nudeic add, 
determining the binding product formed from said nuddc add and said nudeic 
add binding compound as a measure of the presence of said nuddc add. 

35 



1 
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Methods for the determination of nucleic acids by hybridization are generally known in 
the art, for example from Sambrook et al. (cited above). They can easily be adopted to 
the use of the probes of the present invention. Preferably, the nucleic acid binding 
compound will be bound to the nucleic acid in solution, as the reaction is faster than on 
5 a solid phase. It is apparent to a man skilled in the art how to determine the Tm of the 
hybrid of the nucleic acid to be determined and the probe prior to the construction of 
an assay and its general outset. If determined once, it should be clear to the man that he 
should choose similar conditions in each assay using the same analyte nucleic acid. 

10 Such determination will start with providing a sample which is suspected to contain the 
nucleic acid to be determined. The sample may have been subject to steps for bringing 
the nucleic acids in an appropriate form, for example by lysing any cells, in which the 
nucleic acids may be contained and would otherwise not be possible to be determined. 
Further, any steps to provide the sample could contain steps to purify the nucleic acids 

15 to be determined from components of an original sample that could affect the 

determination. Such components could be enzymes that would, if a nucleic acid is set 
free from cells, would digest or degrade the nucleic acid, for example RNases. It is 
further preferred to also remove from the nucleic acids components of the original 
sample that could affect amplification of the nucleic acids. 

20 

There are several possibilities for methods for the determination of nucleic acids using 
the nucleic acid binding compound of the invention. In a first group, the nucleic acid 
binding compound is used as a detectable probe. In this case, the nucleic acid binding 
compound will contain a detectable reporter group. Any hybrids formed from the 

25 nucleic acid binding compound and a nucleic acid can then be determined via the 

detectable reporter group. This group of assays can further be devided into two groups, 
one being the group of homogeneous assays and the other being the heterogeneous 
assays. In heterogeneous assays, preferably the hybrid (binding product) will be 
determined when bound to a solid phase. This embodiment has the advantage that any 

30 excess of probe and other components can be removed easily from the hybrid, thus 
make the determination easier. The hybrid formed can be captured to a solid phase 
either covalently, noncovalently, specifically or unspecifically. There are several 
embodiments which are known to a man skilled in the art. 



35 In the so-called homogeneous assays, the hybrid formed will not be bound to a solid 
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phase, but will be determined either directly or indirectly in solution. A preferred 
example of such assays is disclosed in PCT/US 91/05571 which is incorporated by 
reference here. The nudeic acid binding compound of the invention is especially useful 
as a probe in this assay. As the assay disclosed therein may need fine tuning of melting 
temperatures of the primers and probes used in these assays, the present invention using 
the modulation of a melting temperature of the hybrid formed by the nucleic acid to be 
determined or the amplificates thereof and the probe is especially useful. This is 
especially useful since selectivity could be preserved, as the Tm is reduced by choosing 
a(n-x)-mer oligonucleotide instead of an n-mer oligonucleotide. 

Therefore, a further subject of the invention is a method for the determination of the 
presence, absence or amount of a nucleic acid in a sample comprising the steps: 



providing primers, a first primer being essentially complementary to a first 
15 binding sequence of said nucleic acid, and the second primer being essentially 

complementary to a binding sequence of a complement of this nucleic acid, and a 
probe being complementary to the nucleic acid or the complement thereof 
between the binding sequences of said primers, said probe being labelled at 
different subunits by at least two different reporter groups, 
20 - subjecting the sample with said primers and said probe under conditions 

favouring extention of said primers and separating said reporter groups from each 
other by disintegrating the probe, and 

determining the extent of disintegration of the probe via at least one of said 
reporter groups, 

25 

wherein at least one of said primer or/and probe are a nucleic acid binding compound 
as defined above. Preferably, the melting point of the primers and the probe will be 
selected such that the T m f s are similar, but that the T m of the probe is higher than those 
of the primers. 

30 

In another embodiment, the nucleic acid binding compound of the invention can be 
used as a immobilizable or immobilized probe for binding any nucleic acids to a solid 
phase. Modes to immobilize the compound are disclosed above. In an assay, it is either 
possible to immobilize the compound before contacting it with the sample or during 
35 contacting the sample with a solid phase, or even after contacting the sample with the 



Prtnted:1 9-01 -2001 



• 



mm mwmi 



m 



33 



ID ESC 



solid phase. In each of these cases, the nucleic acid to be determined will be bound to 
the solid phase and preferably, any substituents of the sample not to be determined may 
be washed away from the solid phase, while the compounds and the nucleic acid will 
remain bound. Thereafter, the hybrid bound to the solid phase can be determined by 
known methods, for example by using detectably labelled probe as outlined above or by 
direct determination of the hybrid, for example by contacting the hybrid with the 
intercalating dyes and measuring the change on the solid phase. 



10 



In another embodiment, the immobilized probe is used to isolate or purify a nucleic 
acid. 



15 



Another preferred embodiment uses a nucleic acid binding compound which is both 
bound to a solid phase and labelled by a detectable reporter group. The label in this case 
preferably is a group the detectable properties of which will change when the nucleic 
acid to be determined will bind to the probe. Again, those compounds are known in the 
art. 



20 



25 



The man skilled in the art will be in the position to design the sequence of a nucleic acid 
binding compound when knowing the sequence of the nucleic acid to which the nucleic 
acid binding compound is intended to be bound. In almost all cases, the nucleic acid 
will have a sequence containing all four natural nucleic bases. In this case, it is 
preferable to choose all bases that when bound to the particulars stretch of the nucleic 
acid are located at positions capable of base pairing to C, A and T of the nucleic acid to 
be G, T and A respectively. However, these bases can be replaced by equivalent bases 
base pairing to the mentioned bases in the nucleic acid. The base being at the position 
capable of base pairing to G in the nucleic acid, will now be replaced one or more time 
in the nucleic acid binding compound by a group of I. As outlined in this invention, the 
group of I can also base pair to other moieties, for example to force the orientation of 
binding from the antiparallel (regular) mode to the parallel (non-natural) orientation. 

A further subject of the present invention is the use of 2-azapurine in a nucleic acid 
binding compound as a substitute for cytosine, especially for the binding of nucleic 
acids the bases of which are not only consisting of G. 



35 A further subject of the present invention is the use of 2-azapurine in hybridization 



30 
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reactions of probes with the nucleic acid as a base at the position of the probe base 
pairing at G in the nucleic acid. 



10 



15 



20 



25 



Further subject of the present invention is a binding product of at least one nucleic acid 
binding compound of the invention and a nucleic acid, the nucleic acid binding 
compound and the nucleic acid being bound to each other by base pairing in parallel or 
antiparallel orientation. The binding product can contain one molecule of the nucleic 
acid and one molecule of the nucleic acid binding compound, which form a duplex, or 
the binding product can contain three strands, thus being a triplex. The triplex can 
contain either two molecules of the nucleic acid binding compound and one strand of 
the nucleic acid, or can contain one nucleic acid binding compound and two molecules 
of the nucleic acid Which kind of triplex is formed, is dependent upon the 
concentration and the complementarity of the nucleic acid binding compound and the 
nucleic acid. 

The present invention provides for the possibility to choose similar Tms for a number 
of probes with different sequence. Thus, a particular subject of the invention are 
methods, wherein nucleic acids of different sequences are to be isolated or determined 
simultaneously. 

In a first embodiment, this method is a so-called multiplex method for the isolation or 
determination of nucleic acids. In this embodiment, probes each containing a sequence 
complementary to a sequence of one of the nucleic acids (for example, nucleic acids 
from different viruses, like HCV, HIV and HBV) having a T m adapted according to the 
invention (one or more probes containing a group of formula I) are contacted with the 
sample suspected to contain said nucleic acids. Depending upon the formate, the 
different nucleic acids can be determined via the hybrids formed with the probes either 
as a sum or independently. 



30 



A second embodiment is based on arrays of probes. Especially, a subject of the 
invention is a method for the determination of the presence or absence of nucleic acids 
each comprising a particular sequence in a sample comprising the steps 
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contacting said sample with a solid phase having immobilized on its surface 
nucleic acid binding compounds each containing a sequence complementary to 
one of said particular sequences of said nucleic acids, 

5 - determining on said solid phase the formation of hybrids containing a nucleic 
acid with a particular sequence and the nucleic acid binding compound 
containing the complementary sequence 

characterized in that said at least one of said nucleic acid binding compounds is a 
10 compound comprising a backbone, said backbone having attached heterocyclic 

groups capable of base pairing to natural nucleobases at least one of said 
heterocyclic groups being one of the naturally occurring nucleobases 
characterized in that at least one other of said heterocyclic groups is a group of the 
general formula I. 

15 

Methods of this type are either useful when simultaneously testing a sample for the 
presence or absence of nucleic acids, for example a panel of different bacterial species. 
In this way, it is not only possible to test for one species after each other batchwise, but 
to receive sequence information on different nucleic acids simultaneously. Those types 

20 of methods are generally known in the art The present invention, however, has found 
that the nucleic acid binding compounds of the invention are particularly useful in such 
kind of assays, as the melting temperature can be adjusted to be very similar for nucleic 
acid binding compounds of difference sequence or/and length. It is evident that in order 
to achieve similar melting temperatures of each of the nucleic acid binding compounds, 

25 not all of these different compounds need to be modified according to the invention, 
but the man skilled in the art will modify just those compounds which do not behave 
like the others, for example by having a much higher melting temperature than the 
melting temperatures of the other compounds on the same solid phase. With each C 
replaced according to the invention, the T m may decrease by between 3 and 6 °C, 

30 preferably by between 4 and 5 °C. In this respect, the man skilled in the art will, 

compared to the chip technology presently known, has much more flexibility to select 
the particular sequences used for binding the different nucleic acids. For example, the 
high G-C content of such regions in nucleic acids to be determined up to now 
prevented these regions from being useful as target regions for such assays based on 

35 probes complementary to these regions. According to the present invention, even these 
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regions, which may be highly specific to the nucleic acid to be determined, can now be 
used in such assays. 

Assays using array technology are generally known to a man skilled in the art, for 

5 example, from EP-A-0 476 014. Those solid phases are preferably flat carriers. On their 
surface, separated by pure surface, arrays, each of the arrays having bound a probe of 
different sequence directed to a specific particular sequence in a nucleic acid. 
Depending upon the needs of the actual assay, the sequence of the probe will be 
essentially complementary to one (or more) of the particular sequences contained in 

10 said nucleic acid to be determined. During the assay, each nucleic acid will find the 
probe on the surface to which it can bind. The determination of the formation of 
hybrids in each array on the solid phase allows the determination of the presence or 
absence of nucleic acids containing the particular sequence based on the change of a 
property in this specific array. Preferably, the evaluation of these changes will be made 

1 5 assisted by a computer programme, which knows in which array which particular 
sequence is present. One chip can contain from 2 to thousands, or even millions of 
arrays, each having bound a nucleic acid binding compound having a specific sequence 
which may be unique. However, it is even possible to determine the presence of a group 
of different nucleic acids, each having a sequence in common, just by using a nucleic 

20 acid binding compound being either relatively unspecific and thus binding different 
sequences, or by selecting the sequence of the nucleic acid binding compounds such 
that it is directed to a sequence which is present on different nudeic acids. 



In a second, slightly different approach, such arrays can be used to determine the 
25 sequence of a nucleic acid by following the so-called "sequencing by hybridization" 
approach. In this mode, the sample preferably will contain mostly nucleic acids of the 
same sequence. This can be achieved by isolating the sequence from a mixture in which 
they were contained or by enriching them in-vitro or in-vivo amplification. The 
sequence of the nucleic acid binding compounds bound to the solid phase will be 
30 selected such that they altogether contain a sequence covering the sequence to be 

determined in the nucleic acid. Within this sequence, the sequences of each nucleic acid 
binding compound will overlap by one or more bases. In the determination step of this 
method, it will be detected to which of these partial sequences the nucleic acid will bind, 
which will only occur (ideally), if a sequence complementary to the partial sequence is 
35 contained on a nucleic acid to be determined. Preferably, the evaluation of the result of 
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the assay will be made by a computer. The computer will thus determine from the 
partial sequences that are apparently contained in the sequence to be determined, in 
which series they must have been arranged in the whole nucleic acid. Especially in this 
kind of assays, the present invention is very helpful, as this approach needs a high 
number of nucleic acid binding compounds to be fixed on the surface, which can not 
always take into account the problems occuring with high G-C content. In the present 
invention, it is therefore possible to sequence even nucleic acids having sequences of low 
and of high G-C content simultaneously. 



10 It is apparent to a man skilled in the art that any compound as disclosed herein can to 
some extent be present as tautomers and salts. These tautomers and salts, preferably the 
alkali salts, most preferred the sodium salts, are within the definition of the formulae 
and subject to the present invention. 
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The present invention is explained in more detail by the following examples: 
Examples 



5 General 



10 



15 



20 



25 



30 



35 
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Monomers: Hash chromatography (FC): at 0.5 bar with silica gel 60 (Merck, Darmstadt, 
Germany). Solvent systems for FC and TLC: CH 2 Cl 2 -MeOH 85:15 (A), EtOAc-MeOH 
3:1 (B), CH 2 Cl 2 -MeOH 80:20 (C), CH 2 Cl 2 -HOAc-MeOH 17:1:3 (D), CH 2 Cl 2 -MeOH 
9:1 (E), CH 2 Cl 2 -acetone 85:15 (F). Samples were collected with an UltroRac II fractions 
collector (LKB Instruments, Sweden). Melting points: Biichi SMP-20 apparatus (Btichi, 
Switzerland). UV spectra: U-3200 spectrophotometer (Hitachi, Japan). NMR spectra: 
AC-250 and AMX-500 spectrometers (Broker, Germany); 8 values are relative to 
internal M^Si or external H3PO4. Huorescence spectra were recorded in H 2 0 on a 
F-4500 fluorescence spectrophotometer (Hitachi, Japan). Microanalyses were 
performed by Mikroanalytisches Laboratorium Beller (Gottingen, Germany). 

Oligonucleotides: Oligonucleotides were synthesized with a ABI 392 DNA synthesizer 
(Applied Biosystems, Germany) according to the standard protocol using the "trityl-off * 
mode, except for the unmodified oligodeoxynucleotides which were synthesized using 
the "trityl-on" mode. The coupling yields of modified phosphoramidites were generally 
95% on average (trityl conductivity monitoring). The detritylated modified oligomers 
were purified by ion-exchange chromatography on a Dionex Nudeopac PA- 100 HPLC 
column (4 x 250 mm, P/N 043010, Dionex GmbH, Idstein, Germany) using the 
following gradient: 5 min 5% 0.01 M NaOH/1.5 M aq. Lid (X) in 0.01 M NaOH (Y); 25 
mm5-30%YmX;10nim30^YmX;5mm5% ' 
apparatus: L-4250 UV/VIS detector, L-6250 Intelligent pump and D-2500 integrator 
(Merck-Hitachi, Germany). The tritylated unmodified oligonucleotides were purified 
by RP-18 HPLC using the following apparatus and procedure: 250 x 4 mm RP-18 
column (Merck, Germany); Merck-Hitachi HPLC apparatus consisting of a 655 A-12 
liquid chromatograph with a 655 A variable wavelength UV monitor and a D-2000 
Chromato-Integrator (Merck-Hitachi, Darmstadt, Germany); gradients of O.l M 
(Et 3 NH)OAc (pH 7.0)/MeCN 95:5 (U) and MeCN (V); gradient I: 0 - 50 min 0 - 50% V 
in U, flow rate 1 mL/min; gradient II: 0 - 20 min 0 - 20% V in U; 20 - 40 min 20 - 40% V 
in U, flow rate 1 mL/min. Detritylation was performed by treating the purified 



39 

oligomers with a 2.5% dichloroacetic acid solution in CH 2 C1 2 (1 mL) for 5 min. After 
neutralization with Et 3 N, evaporation to dryness, followed by co-evaporation with 
MeOH, the oligomers were again purified by RP-18 HPLC using the above-mentioned 
device. Gradient: 0 - 30 min 0 - 20 % V in U, 30 - 35 min 20% V in U, 35 - 40 min 20 - 
5 0% V in U, 40 - 45 min 0% V in U. Subsequent desalting for all oligonucleotides was 
performed on an RP-19 HPLC column (4 x 100 mm) using the apparatus as described 
above. Solvent for adsorption: H 2 0, solvent for desorption: MeOH-H 2 0 3:2. General 
flow rate: 1 mL/min. MALDI-TOF Mass spectra of the oligonucleotides were measured 
on a home-built apparatus using UV laser irradiation at 337 nm for 3 nsec. 

10 

The enzymatic hydrolysis of the oligomers was performed as described in Helv. Chim. 
Acta 1998, 81, 1 139-1 155, but using a flow rate of 0.6 mL/min. Quantification of the 
constituents was made on the basis of the peak areas, which were divided by the 
extinction coefficients of the nucleoside (e^ values: dA 15400, dC 7300, dG 11400, dT 
15 8800, z 2 Ad 8200). Snake venom phosphodiesterase EC 3.1.15.1, Crotallus durissus) and 
alkaline phosphatase (EC 3.1.3.1, E. coli) used for the enzymatic hydrolysis of 
oligonucleotides were from Roche Diagnostics GmbH. 

Determination of melting curves and thermodynamics: Absorbance vs. temperature 
20 profiles were measured on Cary 1 or IE spectrophotometers (Varian, Australia) with a 
Cary thermoelectrical controller. The T m values were measured in the reference cell with 
a Pt-100 resistor, and the thermodynamic data (AH°, AS°, AG 0 2 9s) were calculated with 
the program MeltWin 3.0. Circular dichroism (CD) spectra were recorded on a Jasco 
600 (Jasco, Japan) spectxopolarimeter, a thermostatically controlled bath (Lauda RCS-6) 
25 in a 1-cm cuvette. 

Example 1 

3- (2-Deoxy-6-D-erythro-pentofuranosyl)-3H-imidazo [2, 1 -i] purine (ljtfVEtheno-Z 1 - 
30 deoxyadenosine, 5). 2'-Deoxyadenosine monohydrate (1) (5.0 g, 20 mmol) was 

dissolved in 1M aq. sodium acetate buffer (pH 4.5 - 5.0, 110 mL) by warming to 40 - 
50°C. To the solution chloroacetaldehyde (50% aq. soln, 7.7 mol/L, 25 mL) was added, 
and the reaction mixture was stirred for 70 h at room temperature. The yellow solution 
was evaporated to dryness, and the residue was dissolved in MeOH and filtered to 
35 remove inorganic salt. After washing with MeOH the combined filtrate and washings 
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were concentrated in vacuo at 40 - 50°C. The residue was applied to FC (silica gel 60H, 
column: 20 x 6 cm). Elution with CH 2 Cl 2 -MeOH (85:15) gave a main fraction from 
which upon evaporation of the solvent and subsequent crystallization from MeOH- 
EtOAc compd. 5 (3.86 g, 70%) was isolated as colorless crystals. M.p. 138-141°C. TLC 
(silica gel, EtOAc-MeOH, 3:1): R f 0.4. UV (MeOH): 275 (7300), 265 (7600), 258 
(6600), 229 nm (35700). 'H-NMR QD 6 ]DMSO) 8 2.39 (m, 1H, H a -C(2')); 2.70 (m, 1H, 
H 6 -C(2*)); 3.57 (m, 1H, H a -C(5')); 3.67 (m, 1H, H b -C(5')); 3.88 (m, 1H,H-C(4'); 4.43 
(m, 1H, H-C(3')); 4.99 (t, 1H, 3 J(H,H) = 5.2 Hz, 5'-OH); 5.38 (d, 1H, 3 J(H,H) = 3.8 Hz, 
3'-OH); 6.47 (pt, 1H, 3 J(H,H) = 6.2 Hz, H-C(l')); 7.55 (s, 1H, H-C(ll); 8.07 (s, 1H, H- 
C(10); 8.53 (s, 1H, H-C(2)); 9.29 (s, 1H, H-C(8)). 



Example 2 



15 



20 



25 



l-tt-Deoxy-^D-eiythro-pentofur^^ 

Compound 5 (3.85 g, 14 mmol) was treated with IN aq. NaOH (60 mfc) at room 
temperature overnight. The reaction mixture was adjusted to pH 7 by addition of 2N 
aq. HC1 and concentrated to a syrup. This was dissolved in absolute MeOH, and the 
precipitated NaCl was filtered of and washed with MeOH. Filtrate and washings were 
combined and evaporated. The residue was applied to FC (silica gel 60H, column: 20 x 6 
cm). Elution with CH 2 Cl 2 -MeOH (C) afforded a main zone from which compound 6 
(2.70 g, 73%) was obtained as a colorless foam which was used for the next reactions 
without further purification. An analytical sample was crystallized from MeOH-EtOAc 
to give colorless spherical crystals; m.p. 91-93°C (decomp.). TLC (silica gel, CH 2 C1 2 - 
HOAc-MeOH, 17:1:3): R f 0.22. UV (MeOH): JU«271 nm (12800). ! H-NMR 
([D 6 ]DMSO) 6 2.21 (m, 1H, H a -C(2')); 2.47 (m, 1H, H g -C(2')); 3.57 (m, 2H, H 2 -C(5')); 
3.84 (m, 1H, H-C(4')); 4.36 (m, 1H, H-C(3')); 6.00 (pt, 1H, 3 J(H,H) = 6.5 Hz, H-C(l')); 
6.60 (br. s, NH 2 ); 7.13 (s, 2H, H-C(4) + H-C(5)) ; 7.55 (s, 1H, H-C(2)); 8.16 (s, NH). 
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Example 3 

3-(2-Deoxy-S-D-ei7thrc^ 

(l,N 6 -etheno-2-aza-2 f -deoxyadenosine, 7). A solution of compound 6 (4.50 g, 17 
5 mmol) in 80% aq. HOAc was treated with sodium nitrite (1.17 g, 17 mmol) in an ice- 
water bath for 1 h. The reaction mixture was evaporated to a syrup. This was dissolved 
in H2O and evaporated repeatedly to remove HOAc. The residue was applied to FC 
(silica gel 60H, column, 20 x 6 cm). Elution with CH 2 Cl 2 -MeOH (85:15) afforded 
compound 7 (2.50 g, 53%) upon evaporation. M.p. 151-152°C (decomp.). TLC (silica 
10 gel, CH 2 Cl 2 -MeOH, 4:1): R f 0.5. UV (MeQH): W 282 (3100), 268 (3200), 238 nm 

(37900). >H-NMR ([D 6 ]DMSO) 8 2.54 (m, 1H, H a -C(2')); 2.85 (m, 1H, H fi -C(2')); 3.97 
(m, 2H, H 2 -C(5')); 4.00 (m, 1H, H-C(4')); 4.50 (m, 1H, H-C(3')); 4.96 (t, 1H, 3 J(H,H) = 
5.4 Hz, 5'-OH); 5.41 (d, 1H, 3 J(H,H) = 4.3 Hz, 3'-OH); 6.69 (pt, 1H, 3 J(H,H) = 6.3 Hz, 
H-C(l')); 7.85 (d, 1H, 3 J(H,H) = 1.1 Hz, H-C(ll)); 8.75 (d, 1H, 3 J(H,H) = 1.1 Hz, H- 
15 C(10)); 8.95 (s, 1H, H-C(8)). Anal, calcd. for CuH 12 N 6 0 3 (276.25): C 47.83, H 4.38, N 
30.42; found: C 47.71, H 4.32, N 30.32. 

Example 4 

20 4-Amino-7-(2-deoxy-fr^-erythr^ 

(2-aza-2 T -deoxyadenosine, 2). Compound 7 (0.56 g, 2 mmol) was dissolved in 1M aq. 
sodium acetate buffer (pH 4.0 - 4.5, 120 mL) by warming to 40 - 50°C. To this solution 
N-bromosuccinimide (2.8 g, 16 mmol) was added, and the reaction mixture was stirred 
at room temperature overnight. The reaction mixture was evaporated and applied to a 

25 Dowex 1x8 ion exchange column (3 x 12 cm, OH" form). Elution with H 2 0 (250 mL) 

gave compound 2 (0.19 g, 38%) as colorless needles which decompose above 185°C. The 
reaction product was identical with an authentic sample in all respects 1211 . 

Examples 

30 

7-(2-deoxy-S-l>eiythro-pentofti^^ 

(2-aza-2 f -deoxyinosine, 3). Compound 2 (19 mg, 0.076 mmol) was dissolved in H 2 0 
and adenosine deaminase (2 jig, from calf intestine, dissolved in glycerole) was added. 
The reaction mixture was stirred for 18 h at room temperature until 2 had completely 
35 disappeared (UV monitoring) and then evaporated to dryness in a SpeedVac 
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concentrator. UV (H 2 0): 247 (5500), 290 nm (6200). J H-NMR (D 2 0): 2.49 (m, 

1H, H a -C(2')); 2.75 (m, 1H, H 6 -C(2')); 3.41, 3.50 (2m, 2H, H 2 -C(5')); 4.03 (m, 1H, H- 

C(4')); 4.51 (m, 1H, H-C(3')); 6.43 (pt, 1H, 3 J(H,H) = 3.2 Hz, H-C(l')); 8.31 (s, 1H H- 
C(8)). 

Example d 

4-(Benzoylanimo)-7-(2-deoxy-fi-D-eryto^ 

d] [l,2,3]-triazine (8). Compound 2 (125 mg, 0.5 mmol) was co-evaporated twice with 
anhydrous pyridine. The residue was suspended in anhydrous pyridine and treated with 
trimethylsilyl chloride (0.5 mL, 4 mmol). After few minutes of stirring a clear solution 
was formed. The reaction mixture was stirred at room temperature for 2 h. Next, 
benzoyl chloride (0.25 mL, 2 mmol) was added, and stirring was continued for another 
2 h. The reaction mixture was cooled in an ice-water bath, and H 2 0 (1 mL) was added. 
After 10 min the reaction mixture was treated with aqueous cone. NH 3 (0.8 mL) and left 
for additional 30 min. The mixture was then evaporated to dryness, treated with H 2 0, 
and extracted with EtOAc (3 X 20 mL). The combined extracts were dried (Na 2 S0 4 ) and 
applied onto a silica gel column (3 X 15 cm). Ehraon was performed with CH 2 C1 2 (150 
mL), followed by CH 2 Cl 2 -MeOH (9:1). The nucleoside-containing fractions were 
evaporated to dryness, and compound 8 was crystallised from MeOH-H 2 0 to give 
colorless needles (135 mg, 76%). M.p. 208-210° C (decomp > 170°C). TLC (silica gel, 
CH 2 Cl 2 -MeOH 9:1): R f 0.31. UV: (10% MeOH in water): 233 (15600), 276 nm 
(16400). 'H-NMR ([D 6 ] DMSO): 8 2.97 (2m, 2H, H 2 -C(2')); 3.69 (m, 2H, H 2 -C(5')); 
4.00 (m, 1H, H-C(4')); 4.57 (m, 1H, H-C(3')); 5.07 (t, 1H, 3 J(H,H) = 4.8 Hz, 5'-OH); 
25 5.49 (d, 1H, 3 J(H,H) = 4.0 Hz, 3'-OH); 6.72 (t, 1H, »J(HJQ = 6.5 Hz, H-C(l')); 7.61- 
7.78 (m, 4H, aromatic-H), 8.16 (d, 2H, aromatic-H); 9.09 (s, 1H, H-C(8)); 11.84 (s, 1H, 
N-H). Anal, calcd. for C 16 H 16 N 6 0 4 (356.3): C 52.93, H 4.41, N 23.38; found: C 52.68, H 
4.39, N 23.07. 



15 



20 
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Example 7 

7- (2-Deoxy-fi-D-erythro-pentofuranosyl)-4- { [ (dimethylamino) m ethylidene ] -amino }- 
7H-imidazo[4,5-d] [l,2,3]-triazine (9a). To a stirred suspension of compound 2 (63 mg, 

5 0.25 mmol) in MeOH (5 mL) N,N-dimethylformamide dimethyiacetal ( 120 mg, 0.5 
mmol) was added. Stirring was continued for 2 h at room temperature. The reaction 
mixture was evaporated to dryness, and the residue was adsorbed on silica gel. Flash 
chromatography on a silica gel column (3 X 10 cm) with CH2CI2 (100 mL) followed by 
CH 2 Cl 2 -MeOH (9:1) afforded colorless needles (MeOH-H 2 0, 65 mg, 85%). M.p. 173- 

10 175°C. TLC (silica gel, CH 2 Cl 2 -MeOH, 9:1): R f 0.28. UV: (10% MeOH in H z O): 

234 (13250), 319 nm (29500). *H-NMR ([D 6 ] DMSO): 5 2.84 (2m, 2H, H-C(2')); 3.17, 
3.25 (2s, 2H, N-CH 3 ); 3.60 (m, 2H, H 2 -C(5')); 3.92 (m, 1H, H-C(4')); 4.47 (m, 1H, H- 
C(3')); 5.05 (t, 1H, 3 J(H,H) = 4.9 Hz, 5'-OH); 5.39 (d, 1H, 3 J(H,H) = 4.0 Hz, 3'-OH); 
6.55 (t, 1H, 3 J(H,H) = 6.6 Hz, H-C(l')); 8.79 (s, 1H, N=CH); 9.08 (s, 1H, H-C(8)). 

15 Anal, calcd. for Ci 2 H, 7 N 7 03 (307.3): C 46.90, H 5.58, N 31.90; found: C 46.55, H 5.68, N 
31.66. 

Example 8 

20 7- (2-Deoxy-l^D-eiythro-pentofuranosyl)-4-{ [ (di-isobutylamino)methylidene] - 

amino}-7H-imidazo[4,5-d] [ 1,2,3] -triazine (9b). As described for 9a but using N,N-di- 
isobutylformamide dimethyiacetal. Colorless crystals (72%). M.p. 138-140° C. TLC 
(silica gel, CH 2 Cl 2 -MeOH, 9:1): R f 0.40. UV (10% MeOH in water): 236 (10100), 
325 nm (25850). *H-NMR ([D 6 ]DMSO): 8 0.88, 0.94 (2d, 12 H, CH 3 ); 1.95, 2.20 (2m, 

25 2H, CH); 2.80 (2m, 2H, H 2 -C(2')); 3.30-3.74 (m, 6H, H 2 -C(5') and 2 CH 2 ); 4.00 (m, 
1H, H-C(4')); 4.45 (m, 1H, H-C(3')); 5.03 (t, 1H, 3 J(HJ1) = 4.9 Hz, 5'-OH); 5.37 (d, 
1H, 3 J(H,H) = 4.0 Hz, 3'-OH); 6.54 (t, 1H, 3 J(H,H) = 6.4 Hz, H-C(l')); 8.78 (s, 1H, 
N=CH); 9.10 (s, 1H, H-C(8)). Anal, calcd. for Ci 8 H 29 N 7 03 • Vt H 2 0 (400.5): C 53.99, H 
7.55, N 24.48; found: C 53.65, H 7.62, N 24.11. 

30 
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Example 9 



7-(2-Deoxy-frJ>erythro-pentofo^ 

7H-imidazo[4,5-d] [l,2,3]-triazine (9c). As described for 9a but using N,N-di-n- 
5 butyiformamide dimethylacetal. Colorless needles (75%). M.p. 107-109° C. TLC (silica 
gel, CH 2 Cl 2 -MeOH, 9:1): Rf 0.42. UV: (10% MeOH in water): X roax 235 (10200), 325 nm 
(25700). 'H NMR ([D 6 ]DMSO): 5 0.93 (t, 6H, CH 3 ), 1.33, 1.64, 3.70 (3m, 12H, -CH 2 -); 
2.45, 2.80 (2m, 2H, H 2 -C(2')); 3.60 (m, 2H, H 2 -C(5')); 3.92 (m, 1H, H-C(4')); 4.48 (m, 
1H, H C(3')); 5.04 (t, 1H, 3 J(H,H) = 5.8 Hz, 5-OH); 5.39 (d, 1H, 3 J(H,H) = 4.0 Hz, 3'- 
10 OH); 6.55 (pt, 1H, 3 J(H,H) = 6.3 Hz, H-C(l')); 8.78 (s, 1H, N=CH); 9.08 (s, 1H, H- 

C(8)). Anal, calcd. for C I8 H 29 N 7 0 3 (391.5): C 55.23, H 7.47, N 25.05; found: C 55.36, H 
7.66, N 24.97. 



Examplg in 

7- [2-Deoxy- 5-0-(4,4'-dlmemoxytriphenymiemyl)-fr-D-erythro-pentofu^ -4- 
{[(di-n-butylammo)methyUdene]animo}-7H-iniidazo[4,5-d] [ 1,2,3] -triazine (10a). 
Compound 9c (390 mg, 1 mmol) was co-evaporated twice with pyridine, and the oily 
residue was dissolved in anhydrous pyridine (6 mL). Next, 4,4 , -dimethoxytriphenyl- 

20 methyl chloride (450 mg, 1.3 mmol) was added, and the reaction mixture was stirred 
at room temperature for 2 h. Thereupon, MeOH (0.2 mL) was added, and stirring 
was continued for 15 min. The reaction mixture was poured into 15 mL of an aq. 5% 
NaHC0 3 solution, and this was extracted twice with CH 2 C1 2 (30 mL, each). The 
combined extracts were dried over Na 2 S0 4 , evaporated, and the residue was adsorbed 

25 on silica gel. This was applied onto a silica gel 60H - column (4 X 14 cm) and 

chromatographed with a CH 2 Cl 2 -acetone gradient (0 -> 25% of acetone, total volume, 
600 mL). The nudeoside-containing fractions were pooled and evaporated to obtained 
compound 10a as solid foam (560 mg, 81%). TLC: (silica gel, CH 2 Cl 2 -acetone, 85:15): 
Rf 0.15. 'H-NMR ([D«] DMSO): 0.93 (t, 6H, CH 3 ); 1.34, 1.63, 3.75 (3m, 12H, -CH 2 -); 

30 2.95 (2m, 2H, H-C(2')); 3.51 (m, 2H, H 2 -C(5')); 3.63, 3.69 (2s, 6H, OCH 3 ); 4.01 (m, 
1H, H-C(4')); 4.59 (m, 1H, H-C(3')); 5.45 (d, 1H, 3 J(H,H) = 4.1 Hz, 3'-OH); 6.57 (pt, 
1H, 3 J(H,H) = 6.2 Hz, H-C(l')); 6.60 - 7.30 (m, 13H, phenyl-H); 8.71 (s, 1H, N=CH), 
9.07. (s, 1H, H-C(8)). Anal, calcd. for Q9H47N7O5 (693.8): C 67.51; H 6.83; N 14.13; 
found: C 67.15, H 6.82, N 14.13. 
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Example 11 

7- [2-Deoxy- 5-0-(44'-dimethoxytriphenylmethyl)-fi^ -4- 
{ [(di-n-butyiamino)methylidene] amino}-7H-imidazo[4 > 5-d][l > 2,3]-triazine--3 , -[(2- 

5 cyanoethyl)-N,N-diisopropyl phosphoramidite] (10b). To a solution of compound 10a 
(300 mg, 0.43 mmol) in anhydrous CH 2 C1 2 (20 mL) N,N-diisopropylethyiamine (145 
liL, 0.88 mmol) and cUoro-(2-cyanoethoxy)-N,N-dnsopropylaminophosphine (143 ^iL, 
0.62 mmol) were added under an Ar atmosphere. After stirring for 20 min at room 
temperature, a 5% aq. NaHC0 3 solution (15 mL) was added and the mixture was 

10 extracted with CH 2 C1 2 (2 x 30 mL). The organic layer was dried (Na 2 S0 4 )> filtered and 
evaporated. FC (silica gel, column 5 x 10 cm, CH 2 Cl 2 /acetone, 85:15) gave a mixture of 
diastereoisomers of the title compound (300 mg, 78%). TLC (silica gel, CH 2 Cl2/acetone, 
85:15): Rf 0.71, 0.80. 5I P-NMR (CDC1 3 ): 149.962, 150.223. 

15 Example 12 

5>_0-(4,4'-dimethoxytrityl)-6^ 

deoxyadenosine 3 > -[(2-Cyanoethyl)-N,N-(diisopropyl)]phosphoramidite (12) 

To a solution of 5 , -0-(4 > 4 , -dimethoxytrityl)-6-N-((di-n-bulylamino)melJiylene)-2-aza 

20 2 > -deoxyadenosine (300 mg, 0.43 mmol) in anh. CH 2 C1 2 (20 ml) (i-Pr) 2 EtN (145 pi, 0.88 
mmol) and chloro-(2-cyanoethoxy)(diisopropylamino)phosphine (143 pi, 0.62 mmol) 
were added. After stirring for 20 min at r. t., a 5% aq. NaHC0 3 solution (15 ml) was 
added and the mixture was extracted with CH 2 C1 2 (2 x 30 ml). The organic layer was 
dried (Na 2 SC>4), filtered and evaporated. FC (silica gel, column 5 x 10 cm, 

25 CH 2 Cl 2 /acetone, 85:15) gave a mixture of diastereoisomers NR-411 (300 mg, 77%). TLC 
(silica gel, CH^Wacetone, 85:15): R f 0.71, 0.80. 3l P-NMR (CDCI3): 149.962, 150.223. 

Example 13 

30 Synthesis of nucleic acid binding compounds using the monomers of Example 12 
The synthesis was performed as outlined in the General section. 
Example 14 

35 
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Determination of nucleic acids using probes according to Example 13 

Table 1. T m - Values and Thermodynamic Parameters of Duplex Formation of 
Oligonucleotides Containing 2-Aza-2'-deoxyadenosine. 



Oligonucleotide 


T m 
[°C] 


AH 
[kcal/mol] 


AS 

[cal/Kmol] 


AG 
[kcal/mol] 


5'-d(TAGGTCAATACT) 
3*-d(ATCCAGTTATGA) 


46 


-82 


-230 


-10.4 


5'-d(TAGGTC6ATACT) 
3'-d(ATCCAGTTATGA) 


42 


-85 


-245 


-9.2 


5 1 -d(TAGGTC6ATACT) 
3 1 -d( ATCCAGGTATGA) 


46 


-83 


-236 


-9.9 


5 1 -d(TAGGTC66TACT) 
3 1 -d( ATCCAGTTATG A ) 


37 


-76 


-219 


-7.7 


5'-d(TAGGTC66TACT) 
3 ' -d( ATCCAGAAATGA) 


25 


-49 


-141 


-5.7 


3 ' -d( ATCCAGCCATGA) 


20 


-41 


-113 


-5.5 


5'-d(TAGGTC66TACT) 
3'-d(ATCCAGGGATGA) 


46 


-74 


-206 


-10.1 


5 '-d(TAGGTCAATACT) 
3'-d(ATCCAGGGATGA) 


36 


-.47 


-127 


-7.4 


5'-d(TAGGTCGGTACT) 
3'-d(ATCCAGCCATGA) 


54 









10 mM Na-cacodylate, 100 mM Nad, 10 mM MgCl 2 , pH 7; 5 uM single strand 
concentration; 6: 2?A<i. 
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The two oligonucleotides 5'-d(TAGGTCAATACT) and 5'-d(AGTATTGACCTA) were 
constructed to form a stable hybrid with a T m value of 47°C. This duplex is used as a 
standard to study the influence of modified bases on the duplex structure and stability. 
As can be seen from Table 1 the replacement of one central dA - dT by a z 2 Ad - dT base 

5 pair reduces the T m of the duplex by 5°; the exchange of two base pairs reveals a decrease 
of the T m by 10°. The reduction of duplex stability is obviously independent from the 
position of base pair replacement: the duplex having two consecutive z 2 A<i - dT pairs, 
exhibits the same T m as the duplex in which the modified base pairs are separated by 
three regular ones. Duplex stability is linearly decreased further when the number of 

10 z 2 A<i - dT base pairs is increased; the oligonucleotide containing four modified pairs 
exhibits a T m of only 28°C. This result is in striking contrast to findings on 
oligonucleotides in which dA residues are replaced by 8-aza-2'-deoxyadenosine; here, 
the introduction of even four z 8 Aa residues instead of dA does not exert any influence 
on the duplex stability. 

15 

As can be seen from table 1, the TM of an oligonucleotide having at position 7 a G-C 
base pair (last line in table 1) has a TM of 54 °C. Replacement of two Cs in these base 
pairs by 2-azaadenine yields a TM of 46 °C (7 duplex). The same TM is present when 
these artificial base pairs are replaced by the natural base pair A-T (first duplex). 
20 Moreover, the duplex is having mixed 2-azaadenine/G and A-T base pair (third duplex). 
From table 1 it can be learned that replacement of one G-C base pair by an artificial 
base pair of the present invention reduces the TM by between 3 and 5 °C, preferably 4 
°C. 

25 The above described results were found for duplexes with antiparallel chain orientation. 
The same result, however, was also found for oligonucleotide duplexes with a parallel 
strand polarity. The chain orientation of naturally-occuring DNA is antiparallel (aps). 
This orientation can be turned to parallel when the duplex contains isoGd - dC and or 
isoMe 5 C d - dG base pairs (Helv. Chim. Acta. 1997, 80, 73-85). As the pairing of dA 

30 with dT is ambiguous any natural DNA can be hybridized in the parallel mode when 
the second strand contains the bases isoguanine, isocytosine, adenine and thymine. 
As an example, one duplex is given in Table 1. When in this duplex two dA - dT base 
pairs are replaced by z 2 Ad -dT, a reduction of the T m value by 10° is determined which 
is identical with the results for corresponding antiparallel oligonucleotide duplexes. 

35 
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The T m data listed in Table 1 display another interesting feature of the base pairing 
properties of z 2 Aa (2). Stimulated by the finding that replacement of a destabilizing 
central z 2 A<, - dT base pair by z 2 A«, - dG enhances the T m value of the oligomer back to 
the value of the unmodified duplex (T m 46°C), we investigated the duplex stabilities of 
5 oligomers containing mismatches. 

For this purpose oligodeoxynucleoudes were synthesized in which two central z 2 Ad 
residues are placed opposite to either two dT, dA, dC or dG residues. In all cases, except 
for z 2 Aa - dG - containing duplexes, the T m value is significantly decreased - most 
10 pronounced for the oligomer with two z 2 ^ - dC pairs. This oligonucleotide, however, 
exhibits almost the same value as the unmodified duplex. This prompted us to propose 
a z 2 Aa - dG base pair as shown in FIG 1 and FIG 6 (motif I). Thus, the present invention 
can also be used in assays where nucleic acids should be discriminated using 
mismatches. 



15 



The findings on the peculiar base pairing of 2-aza-2'-deoxyadenosine imply that this 
nucleoside exhibits similar pairing properties as 2'-deoxyisoguanosine (isoG d ), the more 
so as both show a similar hydrogen bonding donor-acceptor pattern when assuming a 
keto/H-N(3) tautomeric form of 2'-deoxyisoguanosine (FIG 6, motif II). Indeed, the 
20 latter forms a purine-purine base pair with 2'-deoxyguanosine in oligodeoxynucleotides 
with an antiparallel strand polarity but significantly weaker base pairs with dC and dT, 
and particularly with dA. 

With the aid of these results we anticipate that in parallel oriented oligonucleotides 
25 2-aza-2'-deoxyadenosine will form a base pair with isoG d (FIG 7, motif III) under 
neutral conditions as well as with protonated dC (FIG 7, motif V) in antiparallel 
arranged duplexes. On the other hand, with protonated 2'-deoxyisocytidine an 
antiparallel base pair should be formed following the structural motif VI depicted 
in FIG 7. 
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Claims: 



EPO- 

3 a -4 



899 



A nucleic acid binding compound comprising a backbone, said backbone having 
attached heterocyclic groups capable of base pairing to natural nucleobases at least 
one of said heterocyclic groups being one of the naturally occurring nucleobases 
characterized in that at least one other of said heterocyclic groups is a group of the 
general formula I 



Formula I 



N 



w 



10 



wherein 



15 



W is selected independently from X, Y and Z from the group consisting of N 
and CR 2 , 

Z is selected from the group consisting of N and C with the proviso that 



20 



if Z is N, then 

X independently from W and Y is selected from the group 

consisting of N and CR 3 , and 
Y independently from W and X is selected from the group 

consisting of N and CR 4 , 
and the bond between X and Y is a double bond and the bond 
between Y and Z is a single bond, and 



25 
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10 



15 



20 



X is NR 3 \ and 

Y is selected from the group consisting of N and CR 4 and 

the bond between Z and Y is a double bond and the bond between X 
and Y is a single bond, 

R\ R 2 , R 3 and R 4 are independenfiy selected from the group consisting of -H, - 

halogen, -OR 13 , -SR 19 , -(Q-QoMkyi, -(C 2 -C 10 )-alkenyl, -(C 2 -C 10 )-alkynyl, 

-NO2, -NRV, -cyano, and -G(=0)R H , 
R 11 is selected from the group consisting of -OH, -(Q-QO-alkoxy, -(Q-C22)- 

aryloxy, and NHR 12 , 
R , R 6 , R 12 , R 13 , R 19 and R 33 are selected independently from the group consisting 

of -H, -(C,-C 10 )-alkyl, -(C 2 -Ci 0 )-alkenyl, -(C 2 -C 10 )-alkinyl, -(Q-C22)-aryl, a 

protecting group and a reporter group, 
r and s are independently of each other an integer of 1 to 18, 
D is the position of attachment of the group to the rest of the nucleic acid 

binding compound, and 
said alkyl, alkenyl and alkynyi being unsubstituted or substituted by one or more 
moieties selected from the group consisting of -halogen, -SH, -S-fQ-QO-alkyl, - 
(Q-QO-alkoxy, -OH, -NR^ 6 , -COR 11 , -NH-CONR'R 6 , -NH-CSNR 5 R 6 and 
-[0-(CH 2 ) r ] s -NR 5 R 6 . 



2. The nucleic acid binding compound of claim 1, wherein the backbone comprises 
25 sugar and phosphate moieties. 



30 



A nucleic acid binding compound of claim 2, wherein the sugar configuration is 
selected from the group consisting of the a-D-, fi-D-, a-L- and 0-L- 
configurations. 

The nucleic acid binding compound of claim 3, wherein the sugar moiety is a 
2 , -deoxy-p-D-erythropentofuranosyl moiety. 
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5. The nucleic acid binding compound of claim 1, wherein R 1 is selected from the 
group consisting of -SH, -(Q-QO-alkoxy, -(C 2 -Q)-alkylmercapto, -NR5R 6 , F and 
N0 2 . 

5 6. The nucleic acid compound of claim 5, wherein R 1 is -NR^ 6 . 

7. The nucleic acid binding compound of any of claims 1-6, wherein the backbone 
comprises one or more moieties of the general formula II 

Formula II 



10 




wherein 



A is selected from the group consisting of O, S and N-(Ci-Qo)-alkyl, 
L is selected from the group consisting of oxy, sulfanediyl and -NR 22 -, 
T is selected from the group consisting of oxo, thioxo and selenoxo, 
U is selected from the group consisting of -OH, -O-reporter group, -SH, -S 
15 reporter group -SeH, -(Ci-C 10 )-alkoxy, (Ci-Cio)-alkyl, -(Q-Q^-aryl, -(Q- 

Q^-aryHQ-QoJ-alkyl, -NR^R 24 , and 

-0-(C 1 -C 10 )-alkyl-0-(C 1 -Cio)-alkyl-R 25 , or wherein -NR^R 24 can together 
with N be a 5-6-membered heterocyclic ring, 
V is selected from the group consisting of oxy, sulfanediyl or -NR 22 -, 
20 R M is selected from the group consisting of -H, -OH, -(Ci-Cio)-alkoxy, 

-(C2-Ci 0 )-alkenyloxy, -halogen, -azido, -O-allyl, -O-alkinyl, and -NH 2 , 
R 22 is independently selected from the group of-H and -(Ci-Cio)-alkyl, 
R 23 and R 24 are independently selected from the group consisting of -(Q-Qo)- 
alkyl, -(C,-C2o)-aryl, -(C6-Cu)-aryl-(Ci-C 10 )-alkyl, -(d-Q)-alkyl- 
25 [NHCCHaJdd-NR^R 27 and a reporter group, 
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R is selected from the group consisting of -H, -OH, -halogen, -amino, 

-(C,-C 18 )-alkylamino, -COOH, -CONH 2 and -COO(C 1 -C 4 )-alkyrl and a 
reporter group, 

R 26 and R 27 are independently selected from the group consisting from -H, 

-(Q-QO-alkyl, and -(C 1 -C 4 )-alkoxy-(C i -C 6 )-alkyl and a reporter group, 
c is an integer from 2 to 6, 
d is an integer from 0 to 6, and 
B is a moiety of formula I, 



wherein any alkyl, alkenyl and alkynyl can be substituted or unsubstituted. 

The nucleic acid binding compound of claim 1, wherein R 1 is -NH 2 . 

The nucleic acid compound of claim 1 containing at least one reporter group. 

The nucleic acid binding compound of claim 1, wherein the backbone comprises a 
moiety of the general formula III 



Formula III 




M 

I 

R15 



R14 



wherein 



A is selected from the group consisting of O, S and N-(C 1 -Q)-alkyl, 

M is selected from the group consisting of oxy, sulfanediyl, -NR 22 -, -(C,-Ci 0 )- 

alkyl-, or -CMQ-QoJ-alkyi-O-, and -S-(C 1 -C 10 )-alkyl-O- and -NR 22 - 

(C 1 -C 6 )-alkyl-0-, 

R 22 is selected from the group of-H, -(Q-C,o)-alkyi, a protecting group and a 
reporter group, 
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14 is selected from the group consisting of -H, -OH, -(Q-Cio)-alkoxy, 

-(C 2 -Ci 0 )-alkenyloxy, -(C 2 -Ci 0 )-alkynyioxy, -halogen, -azido, SH, -(Q-C 10 )- 
alkylmercapto and -NH 2> 

is selected from the group consisting of -H, -(Ci-C 6 ) -alkyl, -(Q-Qo)- 
alkenyl, -(C 2 -C 10 )-alkynyl, -(Q-QcO-alkyl-carbonyi, -(C 3 -Ci 9 )-alkenyi- 
carbonyl, -(C 3 -Ci 9 )-alkynyl-carbonyl, -(Q-C 14 )-aiyl-(Ci-Cio)-alkyl, a solid 
phase and a group of formula IV 



>15 



Formula IV 



U - P - R 29 
T 



wherein 

T is selected from the group consisting of oxo, thioxo and selenoxo, and 

U is selected from the group consisting of -OH, -O-reporter group, -SH, -SeH, 

-(Q-C 10 )-alkoxy, -(Q-Cio)-alkyl, -(Q-Q^-aryl, -(Q-C 14 )-aryl-(Ci-Cio)- 

alkyl, -NR^R 24 , and -0-(Ci-C i0 )-alkyl-0-(C 1 -Cio)-alkyl-R 25 , or wherein 
15 NR 23 R 24 can together with N be a 5-6-membered heterocyclic ring, 

R 23 and R 24 are independently selected from the group consisting of -(Q-Cio)- 

alkyl, -(Q-C 20 )-aryl, -(Q-CuJ-aryl-CQ-QoJ-alkyl, -(Ci-Q)-alkyl- 

[NH(CH 2 ) c ] d -NR 26 R 27 , 
R 25 is selected from the group consisting of -H, -OH, -halogen, -amino, 
20 -(Ci-Ci 8 )-alkylamino, -COOH, -CONH 2 and -COO(Ci-C 4 )-alkyi, 

R 26 and R 27 are independently selected from the group consisting from -H, 

-(Ci-Q)-alkyl, and -(Q-C^-alkoxy-Cd-QJ-alkyl 
R 29 is selected from the group consisting of -OR 30 and -SR 30 , 
R 30 is selected from the group consisting of -H, -(Q-Cio)-alkyi, -(C 2 -C i0 )- 
25 alkenyl, -(Cs-Q^-aryi, a protecting group, a solid phase and a reporter 

group 

B is the link to a moiety of formula I, 



wherein any alkyl, alkenyl and alkynyl can be substituted or unsubstituted. 



30 
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The nudeic acid binding compound of any of claims 1, 5 and 6, wherein said 
backbone comprises a moiety of the formula V 




wherein 



A is selected from the group consisting of O, S and N-(Ci-C 6 )-alkyl > 

M* is selected from the group consisting of oxy, sulfanediyl, -NR 22 -, -(Q-Cio)- 

alkyi, or -O-(Ci-Ci 0 )-alkyi-O-, and -S-(Ci-Ci 0 )-alkyl-O- and -NR 22 -(Q- 

Q)-alkyl-0-, 

R 22 is selected from the group of -H, a protecting group, a reporter group and 
-(Ci-Cio)-alkyl, 

R 14 is selected from the group consisting of -H, -OH, -(Q-CnO-alkoxy, 

-(C 2 -Ci<,)-alkenylox7, -(C 2 -Ci 0 )-alkynyloxy, -halogen, azido, -SH, -S-(Q- 
C6)-alkylmercapto and NH 2 , 

R 16 is selected from the group consisting of -H, -(Ci-Q)-alkyl, -(Cz-Qg)- 
alkenyl, -(C 2 -Ci 8 )-alkynyl, -(C 2 -C, 8 )-alkyl-carbonyl, -(C 3 -C 19 )-alkenyl- 
carbonyl, -(C 3 -Ci 9 )-alkynyl-arbonyl, -(C6-Ci 4 )-aryl-(Ci-Q)-alkyl, a 
protective group or a compound of formula IV 

Formula IV | 

U - P - R29 

T 

wherein 



T 
U 



is selected from the group consisting of oxo, thioxo and selenoxo, 

is selected from the group consisting of -OH, -SH, -SeH, -(C l -C 10 )-alkoxy, 

-(C,-C 10 )-alkyl, -(Q-C^-aryl, -(C 6 -C 14 )-aryl-(C 1 -C 1 o)-alkvl, -NR 2 ^ 24 , and 
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15- 



20 



-O^d-doi-alkyi-O-Cd-doJ-alkyl-R 25 , wherein NR^R 24 can together with 
N be a 5-6-membered heterocyclic ring, 
R 23 and R 24 are independently selected from the group consisting of -(Q-Qo)- 
alkyl, -(Ci-C2o)-aryl, -(Q-C M )-aryl-(C 1 -C,o)-alkyl, -(Q-Q)-alkyl- 
[NHCC^cJa-NR^R 27 , 
R 25 is selected from the group consisting of -H, -OH, -halogen, -amino, 

-(Ci-Ci 8 )-alkylamino, -COOH, -CONH 2 and -COO(C 1 -C 4 )-alkyl, 
R 26 and R 27 are independently selected from the group consisting from -H, 
-(C,-Q)-alkyl, and -(C ! -C 4 )-alkoxy-(C l -C 6 )-alkyl 
is selected from the group consisting of -OR 30 and -SR 30 , 
is selected from the group consisting of -H, -(Ci-Qo)-alkyl, -(Q-Qo)- 
alkenyl, -(C6-C2 2 )-aryl, a protecting group, a solid phase and a reporter 
group, and 

is the link to a moiety of formula I, 



->29 



-.30 



B 



wherein any alkyl, alkenyl and alkynyi can be substituted or unsubstituted. 

12. The compound of claim 1 1, wherein M' is O, R 16 is H and R 14 is selected from the 
group consisting of -H and -OH. 

13. A compound of formula VI 



Formula VI R! 6 



Ml 



M 

I 

R1S 



R14 



25 



wherein 



A is selected from the group consisting of O, S and N-CQ-QO-alkyl, 
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R u is selected from the group consisting of -H, -OH, -(Ci-Qo)-alkoxy, O- 
protecting group, S-protecting group, NH-protecting group, -(C2-C10)- 
alkenyloxy, -halogen, -azido, -SH, -(Q-C 6 )-alkylmercapto and -NH 2 , 

R 15 and R 16 are independendy selected from the group consisting of -H, -(Q-Cs)- 
alkyl, -(C2-C, 8 )-alkenyl, -(C 2 -Ci 8 )-alkynyl, -(Q-CigValkyl-carbonyl, 
-(C 3 -C 19 )-aIkenyl-carbonyl, -(C 3 -Ci 9 )-alkynyl-carbonyl, -(C6-Cu)-aryl- 
(Ci-Q)-alkyl, a protecting group or a compound of formula IV 

Formula IV | 

. U - P - R 29 



wherein 

T is selected from the group consisting of oxo, thioxo and selenoxo, 

U is selected from the group consisting of -OH, -SH, -SeH, -(Q-Cio)-alkoxy, 

-(Ci-C l0 )-alkyl, -(QrC 2 2)-aryl, -(C6-C u )-aryl-(Ci-Ci 0 )-aIkyl, -NR 2 ^ 24 , and 

-0-(C 1 -Cio)-alkyl-0-(Ci-Cio)-alkyl-R 25 , or wherein NR^R 24 can together 

with N be a 5-6-membered heterocyclic ring, 
R 23 and R 24 are independently selected from the group consisting of -(Q-Qo)- 

alkyl, -(d-C^-aryi, -(C 6 -Ci 4 )-ar>i-(C 1 -Cio)-allcyl, -(d-QJ-alkyl- 

[NH(CH 2 ) c ]d-NR 26 R 27 , 
R 25 is selected from the group consisting of -H, -OH, -halogen, amino, 

-(Q-dgJ-alkylamino, -COOH, -CONH 2 and COO(Q-C 4 )-alkyl, 
R 26 and R 27 are independently selected from the group consisting from -H, 

-(Q-QO-alkyl, and -(Ci-C^-alkoxy-Cd^-alkyl, 
R 29 is selected from the group consisting of -OR 50 and -SR 30 , 
R 30 is selected from the group consisting of -H, -(Ci-Qo)-alkyl, -(Q-Qo)- 

alkenyl, -(C6-C22)-aryl, a protecting group, a diphosphate and a reporter 

group, and 

M and M 1 are independently selected from the group consisting of oxy, 

sulfanediyl, -NR 22 , -(Q-CuO-alkyl, or -O-(Ci-Ci 0 )-alkyi-O-, and -S- 
(Ci-Qo)-alkyl-O- and -NR 22 -(C r C6)-alkyi-0-, 

R 22 is selected from the group of -H and -(Ci-Qo)-alkyl, and 
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B is a moiety of formula I, 



wherein any alkyl, alkenyl and alkynyl can be substituted or unsubstituted, and 
wherein at least one of R 15 and R 16 is not a group of formula IV with the proviso 
5 that 

MR 16 , MR 15 and R u are not each -OH if R 1 is -NH 2 and if either 

- WandXandYandZisN.or 

- WandXandZisNandYisCR 4 ,or 

- WandYandZisNandXisCR 3 . 

10 

14. A compound of formula VII 



Formula VII 




M R 14 

" R 18 



wherein 

15 

A is selected from the group consisting of O, S and N-(Ci-Q)-alkyl, 
M and M' are independently selected from the group consisting of oxy, 

sulfanediyl, -NR 22 , -(C r C 10 )-alkyl, or -O-(Ci-Ci 0 )-alkyl-O-, and -S- 

(Ci-Cio)-alkyl-O- and -NR 22 -(C 1 -C 6 )-alkyl-0-, 
20 R 22 is selected from the group of -H and -(Q-C^-alkyl, 

R 14 is selected from the group consisting of -H, -OR 31 , -(Ci-C 10 )-alkoxy, 

-(Q-CioJ-alkenyloxy, -(C 2 -Cio)-alkynyloxy, -halogen, -azido NHR 31 , SR 31 

and -NH 2 , 

R 31 is a protecting group or a reporter group, 
25 R 32 and R 17 are independently selected from the group consisting of -H, -(Ci -Qo)- 

alkyl, -(Q-CioJ-alkenyl and -(C6-C22)-aryl, 
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R is selected from the group consisting of substituted or unsubstituted -(Q- 
C6)-alkyl, unsubstituted -(Q-QO-attcoxy or -(Q-QJ-alkoxy substituted oi 
or more times by a group selected from the group consisting of -halogen, 
p-nitroaryloxy and -cyano, and 

B is a group of formula I 



Formula I ,R 1 




wherein 

10 

W is selected independently from X, Y and Z from the group consisting of N 
and CR 2 , 

Z is selected from the group consisting of N and C with the proviso that 
15 - ifZisN,then 

X independendy from W and Y is selected from the group 
consisting of N and CR 3 , and 

Y independendy from W and X is selected from the group 
20 consisting of N and CR 4 , 

and the bond between X and Y is a double bond and the bond 
between Y and Z is a single bond, and 
- ifZisCthen 

25 X isNR 33 ,and 

Y is selected from the group consisting of N and CR 4 and 
the bond between Z and Y is a double bond and the bond between X 

and Y is a single bond, 
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R , R , R and R are independendy selected from the group consisting of -H, - 

halogen, -OR 13 , -SR 19 , -(C,-Qo)-alkyl, -(C 2 -C 10 )-alkenyl, -(C 2 -Ci 0 )-alkynyl, 

-N0 2 , -NR^ 6 , -cyano, and -C(=0)R u , 
R 11 is selected from the group consisting of -OH, -(Ci-Q)-alkoxy, -(Q-C 22 )- 

aryloxy, and NHR 12 , 
R 3 ,R°,R",R",R iS andR 33 are selected independently from the group consisting 

of -H, -(Q-QoJ-alkyl, -(C 2 -Ci 0 )-alkenyl, ~(C 2 -C 10 )-al]rinyi, -(Q5-C 22 )-aryl, a 

protecting group and a reporter group, 
r and s are independently of each other an integer of 1 to 18, 
D is the position of attachment of the group to the rest of the nucleic acid 

binding compound, and 
alkyl, alkenyi and alkynyl being unsubstituted or substituted by one or more 
moieties selected from the group consisting of -halogen, -S-CQ-C^-alkyl, -(Q- 
Q)-alkoxy, -NR 5 R 6 , -CO-R 11 , -NH-CO-NR 5 R 6 , -NH-CSNR 5 R 6 and -[0-(CH 2 ) r ] s - " 
NR^ 6 , 

with the proviso that at least one of R 5 and R 6 of -NR 5 R 6 is a protecting group. 

-A compound of claim 14, wherein R 1 is not OR 31 . 

A compound of claim 14, wherein said group of formula I contains at least one 
reporter group. 

A compound of any of claims 14 and 15, wherein said group of formula I is 
selected from the group consisting of groups of formula I, wherein either 

W is N, Z is N, Y is N and X is CR 3 , or 
W is N, Z is C, Y is N and X is CR 3 , or 
WisN,ZisN,YisNandXisN. 

A compound of claim 5 with the proviso that R 31 is not H. 

The binding product of at least one nucleic acid binding compound of any of 
claims 1 to 11 and a nucleic acid, the nucleic acid binding compound and the 
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nucleic acid being bound to each other by base pairing in parallel or antiparallel 
orientation. 



20. A method for the determination of a nucleic acid comprising the steps 

5 

providing a sample suspected to contain said nucleic acid, 
providing a nucleic acid binding compound of claim 1, which is essentially 
complementary to a part or all of said nucleic acid, 
contacting said sample with said nucleic acid binding compound under 
10 conditions for binding said nucleic acid binding compound to said nucleic 

acid, 

determining the binding product formed from said nucleic acid and said 
nucleic acid binding compound as a measure of the presence of said nudeic 
acid. 

15 

21. A method of claim 20, wherein any groups of formula I in said nucleic acid 
binding compound of claim 1 is located in said compound as to base pair to a G 
moiety in said nucleic acid. 

20 22. Use of a 2-azapurine in a nucleic acid binding compound as a substitute for 
cytosine. 



23. The use of a 2-azapurine in hybridization reactions of probes with nucleic acids as 
a base at a position of the probe base pairing with G in the nucleic acid. 

25 

24. A method for the chemical synthesis of a compound of any of claims 1 to 13 using 
activated subunits, wherein said subunit contains at least one group of formula I. 

25. A method of claim 24, wherein at least one subunit is a compound of any of 
30 claims 13 to 21. 



26. A method for the enzymatic synthesis of a compound of any of claims 1 to 13, 

comprising reacting a triphosphate subunit with a primer using a nucleic acid as a 
template for the elongation of the primer, wherein the triphosphate subunit 
35 contains a heterocyclic group of formula I. 
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A method according to claim 26, wherein said triphosphate subunit has the 
formula VIII 



Formula VIII 



PPP-O 



B 




OH R 14 



wherein 



PPP is a triphosphate group, 

R 14 is selected from the group consisting of -H, -OH, -(Ci-Qo)-alkoxy, 

-(Q-Cio)-alkenyloxy, -(C 2 -Ci 0 )-alkynyloxy halogen, -azido andNH 2 > and 
B is a group of formula I, 

with the proviso that R 14 is not OH if B is 2-azaadenine. 

A method for the determination of the presence or absence of nucleic acids each 
comprising a particular sequence in a sample comprising the steps 

contacting said sample with a solid phase having immobilized on its surface 
nucleic acid binding compounds each containing a sequence 
complementary to one of said particular sequences of said nucleic acids, 

determining on said solid phase the formation of hybrids containing a 
nucleic acid with a particular sequence and the nucleic acid binding 
compound containing the complementary sequence 

characterized in that said at least one of said nucleic acid binding compounds is a 
compound according to any of claims 1 to 13. 



A method for the determination of the presence, absence or amount of a nucleic 
acid in a sample comprising the steps: 
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• 



10 



providing primers, a first primer being essentially complementary to a first 
binding sequence of said nucleic acid, and the second primer being 
essentially complementary to a binding sequence of a complement of this 
nucleic acid, and a probe being complementary to the nucleic acid or the 
complement thereof between the binding sequences of said primers, said 
probe being labelled at different subunits by at least two different reporter 
groups, 

subjecting the sample with said primers and said probe under conditions 
favouring extention of said primers and separating said reporter groups 
from each other by disintegrating the probe, and 

determining the extent of disintegration of the probe via at least one of said 
reporter groups, 



15 



characterized in that at least one of said primer or/and probe are a nucleic acid 
binding compound according to any of claims 1-13. 



30. A compound of the general formula VIII 

Formula VIII 

PPP -O 




OH R 14 



20 



wherein 



25 



PPP is a triphosphate group, 

R 14 is selected from the group consisting of -H, -OH, -(Ci-Cio)-alkoxy, 

-(C2-Ci 0 )-alkenyloxy, -(C 2 -Ci 0 )-alkynyloxy halogen, -azido and NH 2 , and 
B is a group of formula I, 

with the proviso that R 14 is not OH if B is 2-azaadenine. 



31. A compound of claim 30, wherein -M-R 16 is a triphosphate group and -M-R 15 is 
-OH. 
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32. A compound of claim 3 1 , wherein R 14 is -OH. 

33. A compound of claim 30, wherein R 14 is -H and R 1 is not OH. 

34. A compound of general formula IX 



Formula IX 




R 18 N M' 



wherein 



10 



A is selected from the group consisting of O, S and N-(Ci-C 6 )-alkyl > 
M and M 1 are independently selected from the group consisting of oxy, 

sulfanediyl, -NR 22 , -(Q-Qo)-alkyl, or -O-(Q-C 10 )-alkyl-O-, and -S- 

(Q-Cio)-alkjd-O- and -NR 22 -(Ci-Q)-alkyl-0-, 
15 R 22 is selected from the group of -H and -(Q-Cio)-alkyl, 

R 14 is selected from the group consisting of -H, -OR 31 , -(Ci-Cio)-alkoxy, 

-(Q-Cio)-alkenyloxy, -(Q-C^-alkynyloxy, -halogen, -azido NHR 31 , SR 31 

and -NH 2 , 

R 31 is a protecting group or a reporter group, 
20 R 32 and R 17 are independently selected from the group consisting of -H, -(Q-Qo)- 

alkyl, -(C 2 -Qo)-alkenyl and -(QrC 22 )-aryl, 
R 18 is selected from the group consisting of substituted or unsubstituted -(Q- 
QJ-alkyl, unsubstituted -(Q-CaJ-alkoxy or -(Ci-Qj)-alkoxy substituted one 
or more times by a group selected from the group consisting of -halogen, 
25 p-nitroaryloxy and -cyano, and 

B is a group of formula I 
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Formula I 



R 1 



N 

! 

N 




X 



w 



wherein 

W is selected independently from X, Y and Z from the group consisting of N 
andCR 2 , 

Z is selected from the group consisting of N and C with the proviso that 
ifZisN, then 



10 



15 



X independently from W and Y is selected from the group 

consisting of N and CR 3 , and 
Y independently from W and X is selected from the group 

consisting of N and CR 4 , 
and the bond between X and Y is a double bond and the bond 
between Y and Z is a single bond, and 



if Zis C, then 



20 



25 



X is NR 33 , and 

Y is selected from the group consisting of N and CR 4 and 

the bond between Z and Y is a double bond and the bond between X 
and Y is a single bond, 

R , R , R and R are independently selected from the group consisting of -H, - 

halogen, -OR 13 , -SR 19 , -(Ci-Ci 0 )-alkyl, -(C 2 -Ci 0 )-alkenyl, -((VQcO-alkynyi, 
-N0 2 , -NR'R 6 , -cyano, and -C(=0)R n , 

R" is selected from the group consisting of -OH, -(C 1 -C 6 )-alkoxy, -(QS-C22)- 
aryloxy, and NHR 12 , 
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R 5 ,R 6 ,R 12 ,R 13 ,R 19 andR 33 are selected independently from the group consisting 
of -H, -(Ci-Cio)-alkyl, -(C 2 -C l0 )-alkenyl, -(C 2 -C 10 )-alldnyl, -(Q-C^-aryl, a 
protecting group and a reporter group, 
r and s are independently of each other an integer of 1 to 18, 
D is the position of attachment of the group to the rest of the nucleic acid 

binding compound, and 
alkyl, alkenyl and alkynyl being unsubstituted or substituted by one or more 
moieties selected from the group consisting of -halogen, -S-(Ci-C6)-alkyi, -(Q- 
C 6 )-alkoxy -NR 5 R 6 , -CO-R 11 , -NH-CO-NR^ 6 , -NH-CSNR^ 6 - and -[O- 
(CH 2 ) r ] s -NR 5 R 6 , 

with the proviso that at least one of R 5 and R 6 of -NR^ 6 is a protecting group. 



35. 



A compound of general formula X 

Formula X 




M R 14 



H- 



25 



wherein 

M and M' are independently selected from the group consisting of oxy, 

sulfanediyl, -NR 22 , -(Ci-Cio)-alkyl, or -O-(Ci-Ci 0 )-alkyl-O-, and -S- 
(Ci-Cio)-alkyl-O- and -NR 22 -(Ci-C 6 )-alkyl-0-, 
is selected from the group of -H and -(Ci-Cio)-alkyl, 
is selected from the group consisting of -H, -OR 31 , -(Ci-Cio)-alkoxy, 
-(C2-Cio)-alkenyloxy, -(Q-Cio)-alkynyloxy, -halogen, -azido NHR 31 , SR 31 
and -NH 2 , 

is a protecting group or a reporter group, 
is a group of formula I 



R 22 
R 14 



R 31 
B 
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Formula I R 1 

w z 



Y 



wherein 

W is selected independently from X, Y and Z from the group consisting of N 
and CR 2 , 

Z is selected from the group consisting of N and C with the proviso that 
ifZisN.then 

X independently from W and Y is selected from the group 
consisting of N and CR 3 , and 

Y independently from W and X is selected from the group 
consisting of N and CR 4 , 

and the bond between X and Y is a double bond and the bond 
between Y and Z is a single bond, and 

if Z is C> then 

X isNR 33 ,and 

Y is selected from the group consisting of N and CR 4 and 
the bond between Z and Y is a double bond and the bond between X 

and Y is a single bond, 



R 1 , R 2 , R 3 and R 4 are independently selected from the group consisting of -H, - 

halogen, -OR 13 , -SR 19 , -(Q-C 10 )-alkjrl, -(Q-C^-alkenyl, -(Q-C 10 )-alkynyl, 
-N0 2 , -NR^ 6 , -cyano, and -C(=0)R u , 
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R 11 is selected from the group consisting of -OH, -(Ci-C 6 )-alkoxy, -(C 6 -C 22 )- 

aryloxy, and NHR 12 , 
R 5 , R 6 , R 12 , R 13 , R 19 and R 33 are selected independently from the group consisting 

of -H, -(Q-Cio)-alkyi, -(C 2 -Qo)-alkenyl, -(C 2 -Ci 0 )-alkinyl, -(Q-C^-aryl, a 
5 protecting group and a reporter group, 

r and s are independently of each other an integer of 1 to 18, 

D is the position of attachment of the group to the rest of the nucleic acid 

binding compound, and 
alkyl, alkenyl and alkynyl being unsubstituted or substituted by one or more 
10 moieties selected from the group consisting of -halogen, -S-(Q-C6)-alkyl, 

-(Q-Q)-alkoxy, -NR^ 6 , -CO-R 11 , -NH-CO-NR 5 R 6 , -NH-CSNR^ 6 - and -[O- 
(CH 2 ) r ] s -NR 5 R 6 . 



15 
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3tt Aug. 1999 

Abstract of the disclosure 



5 Within oligonucleotides 2-azapurine and especially 2-azaadenine bases form specifically 
base pairs with guanine. This base pair is of analogous stability as an adenine-thymine 
but less stable than a guanine-cytosine base pair. Therefore, the incorporation of 2- 
azaadenine residues into oligonucleotides instead of cytosine leads specifically to 
hybridization complexes with nucleic acids with homogenous stability. This is useful for 

10 the adaptation of the stabilities of different oligonucleotide sequences in all kinds of 
hybridization techniques, for example in oligomer chip technology. 
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Figure 2 




2: R=H 
11:R=P,0 9 H 4 



<MeO) 2 TrO' 



N=CHN(n-Bu) 2 
O 




N=CHN(n-Bu) 2 



O-P-O NHEta 
H 
12 




(MeO) 2 TrO 



NCCH 2 CH 2 0^ N(!-Pr) 2 
10 b 
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©RAW 



Figure 3 




purine numbering systematic numbering 
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DRAW a 




Figure 4 



CICH 2 CHO, 

1M aq. NaOAc buffer, 

pH4-5 

1 » 

70% 



NBS: N-bromosuccinimide 




7 
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Figure 5 



HNBz 



.N 



8 



N ^CHNR 2 




N ^CHN(n-Bu) 2 




(MeOfeTrO 



9a : R = C^ 
9b : R = i-butyl. 
9c : R = n-butyl. 



10a : R = H 

10b : R = -P[(i-Pr 2 N][0(CH 2 ) 2 CN] 



Figure 6 




dG z 2 ^ dG isoG d 



Watson-Crick mode 

motif I, antiparallei stranded 



Watson-Crick mode 

motif II, antiparallei stranded 




reverse Watson-Crick mode reverse Watson-Crick mode 

motif III, parallel stranded motif IV, parallel stranded 




isoC d HT z 2 ^ dCrf z 2 /^, 

Watson-Crick mode reverse Watson-Crick mode 

motif VI, antiparallel stranded motif V, parallel stranded 
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